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TITLE: improved proteases and methods for producing them 
FIELD OF INVENTION 

r „ r ^ — * — Phases. The 

«-* a., jzztz ~ rr ^ and host ^ —■^ ,hese 

within animal feed. Pr ° dUC,n9 USin9 protea ^ *> P«**r. 

15 BACKGROUND 

B^r^: 2 io rtraT; t r **• j - m °^- ^ ™- ** j 

♦ wu, j o, ana Eur. J. Biochem. 1999 ora cm ccn. 
nomenclature is regularly supplemented and .,„h=, h ' respec " vel >'- The 

~ e., the Wodd W,de We. (WWW) a, 

' an*,o jr.T^rr; : ~t r-* ,hai * ° ne ° r 

resist a^ns, proteolytic degradation IT. ^ 

nonpar C-lerrainal taite of secreted proteins. ^ * reC ° 9ni2e 
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of nascent ^ induce * T" ^ ^ * ** C te ™"~ a 

33823). 9B 9 dURn9 t ' anSla,i0n '^nation. J.Bfal.Chem. 277:33825- 

SUMMARY OF THE INVENTION 

.- .hay may be £ ^ ~ ^ ™> ° a >° or 

m sub-optima, yiaida. Many om^Ll *"* ** "»—*-. 

understood a. Lml7Z££T?7 *° ** inSlab,ll,> ' ^ a " °' «"* ™ 

~* sa j^ m :rr u rr^r pro,eo,yuc - *— 

within the last four ami ™ L ^ . 1 1 nonw,r or ""charged polar amino acids 
iaas, three ncn-poia, "J^^T" Tha o,,„e a, 

• amino acid(s, m . tT 7 * « - ™» 

PO>ynuc 1 aoJ 1 oa, M anZll^::~ * - — * — - — « 
- -sang ^ ~ ~"» *— -V -id b a ,o mod^oneor 

configumdonsatftecamninua cr ; , ~ M!,PeP8da lh — p, ** r »*» 
yiaida as compared ,o £Z * " '~ ™* 

, ^zzzzzzr ,hese — ami "° - 

P-ea S aa^ i p^;t::rc' h ? VenBOn ' ~— h- 

in SEQ ID NO- 28 Z " ^T" "* ^ * ■"*"•*'• sh ~" 

orSEQ ID NO-,5 "* % "° '° N0: 3 * SEQ 10 Ntt «* « ID NO: 43; 



99% iden« ral lo ttetmtno l d ' ^ * % ' "* 97% ' 98% ' °' 

the ammo ecd sequence of the the mature part of the polypeptide 
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low. medlar, madlum-blob. hi9h . or very Wgh ™— - 

■ (0 a Pdynudeotide encode the malure ^ e( , 

P*~**» pbtainab* from gcnomlc dna <J £LZ 

tf ~ e ' DSM 43235 * use - «■« ~ 

f Q °« " ^ 35; !»— DSM ,5648 by use of 

Lr. ,D t 38 and * - - i— 

byu Seo fpnmeisS6QlDNOs:42a n d39- 

SI" 8 ,D NO: 32;of SEQ ,D N0: 3S - « 

(IV) a complementary strand of (I), (||), or (III)- 

NO: 4,; SEQ , T N0 : « ^ s E q 1 NO- 45 "* * ^ '° "* « SEQ '° 

(e) is an allelic variant of (a), (b). (c), or (d); or 
(f> is a fragment of (a), (b). (c), (d). or (e) 

rnT^Ta", p ?r* won9s ,o ,he szA ' ° r ,he sie 
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or e . f T 1 mVenti ° n ^ t0 3 tranS9enlC ' non - human anima '. ^ Products 

or e,ements thereof, comprising a polynucleotide as defined in the second aspect or an 

expression vector or polynucleotide construct as defined in the third aspect ' 

. . T h !T nth aSPSCt ° f 0,6 invention *> * method for producing a polypeptide as 

dl I"! Z r , 7* ' ^ meth ° d C ° mPriSin9: ^ CU,UVatin9 3 — ntZceU as 
tooodJ^ 

to produce a supernatant comprising the polypeptide, and optionally (b, recovering the 
. polypeptide. 

: Other aspects of then invention relate to: an anima. feed additive comprising at least one 

. polypeptide as defined in the first aspect; and 

(a) at least one fat-soluble vitamin, and/or 

(b) at least one water-soluble vitamin, and/or 

(c) at least one trace mineral; 

an animal feed composition having a crude protein content of 50 to 800 g/kg and 
compns, ng at least one polypeptide as defined in the first aspect, or at least one feed additive of 
the previous aspect; . auumveo. 

■ a composite, comprising at teas, one poiypep.de as defined in the first aspect, together 

Tec 2 Z™, T from "*as. <* 3.1.3.8 or 3.1.326) JanTse 

(EC 3.2.1.8). gatactanase (EC 3.2.1.89); aipha-galactosidase (EC 3.2.1.22); protease (EC 3 4 V 

EC?2 1 6 T A3>: Ph0SPh °' iPaSe ° <EC 3XAA) - "** <« 3.2,. « 

nutri U ona, m v!r u ! li 1" US ' n9 " ^ °~ P °' yPeP " de de " ned *• ** ** .he 

ZZT . " hy< "°' ySlS * h a "' mal «*» «* .he <rea«men, c* 

composition for use in animal feed; 

in., h* 3 r th °, d ^ ^ feaSt ^ P0,ypeptide 35 defined in the «■* aspect, comprising 
-ncludmg the polypeptide(s) in a detergent formulation. 9 

DETAILED DESCRIPTION OF THE INVENTION 

Proteases are classified on the basis of their catalytic mechanism into the following groups- 

uZTZZ ft Pr0t6aSeS ~ S ^proteases (M, and 

Unknown, or as yet unclassified, proteases (U). see Handbook of Proteolytic Enzymes. 
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Serin* proteases are ubiquitous, being found in viruses, baoleria and eukaryotes- thev 
KM. exopepbdase. endopepbdase. oBoopepUdase and omega-pep^dase a^Over 20 

our of the Cans (SA. SB, SC and SE,. these appear ,o be totatty unrefafed. suggest £ 
. our evotubonary origins o, serine peptidases. A,pna4y,,c endopepbdases beiong ,1 t he 
chyrno.ryp.s.n <SA, Can. v*hin which fney bave been assigned ,o subfan* A of the S2 J£ 

and is thTI- T""" SyS ' em °' P '° te0ly,iC ""»"- iS tesed « ■*"«*» 
: r^r T - « *— * bribed in Ratings. N.D. ei 

a. 2002 MEROPS. The protease database. Nucleic Adds Res. 30:343-346 The MEROPS 
database is freety avaiiabie eiecfronicatiy at M ^JJ^ 

S! tr^ Pr0 ' e0MiC ^ »52A,„TheHandbooKo, Proteose E ^ a re 

of o rd T*" " ™ P ' - — <""*<" ND ' B ™ « 0«» ^vlonTfa^ 
of peptidases. Biochem. J. 290:205-218). '»«°nary ramnies 

In particular embodiments, the proteases of the invention and for use acoordino to the 
.rwent.cn are selected from the group consisting of: ^ 

(a) proteases belonging to the EC 3.4.-.- enzyme group; 

(b) Serine proteases belonging to the S group of the above Handbook; 
(c1) Senne proteases of peptidase family S2A; 

Tl ■ Pr0teSSeS ° f P6PtldaSe fam,,y S1E 35 deSCribed in Hoei^mJ. 290:205-218 (1993) 

and ,n MEROPS a protease database, re.ease 6.20. March 24, 2003. (www.merops.ac uK T 
database ,s desenbed in Rawlings, N.D.. O'Brien, E. A. a Barrett. A.J. (2002) MEROPS ^ 
protease database. Nucleic Acids Res. 30. 343-346 M5ROPS. the 

«^rZTLT*r ^ ProteaSe 58 8 S6rine ^ 3 *"* S2A »~ 

reference ,s made to the above Handbook and the principles indicated therein Such 

de ter _ can be carried out for a„ types of proteases, be it nature!* occumngTl ^ 
proteases; or genetically engineered or synthetic proteases 

indudes^r T7 66 meaSUred ^ ln Which 3 substra te is empJoyed that 

•ncludes peptide bonds relevant for the specificity of the protease in question AssaJpH Z 
assay-temperature are likewise to be adapted to the protease in question Examp e^-pH 
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values are pH 2. 3. 4, 5, 6, 7, 8. 9, 10, ,1, or 12. Examples of assay-temperatures are 30 35 37 
40, 45, 50, 55, 60, 65. 70, 80, 90. or 95°C. 

Examples of protease substrates are casein, such as Azurine-Crosslinked Casein (A2CL- 
oase,n). Two protease assays are described in Example 2 herein, either of which can be used to 
determine protease activity. For the purposes of this invention, the so-ca..ed pNA Assay is a 
preferred assay. J 

There are no limitations on the origin of the protease of the invention and/or for use 
accord.ng to the invention. Thus, the term protease includes not only nature, or wild-type 

; Proteases obtained from microorganisms of any genus, but also any mutants, variants, fragments 
etc thereof exhibiting protease activity, as we., as synthetic proteases, such as shuffled 
proteases, and consensus proteases. Such geneticaHy engineered proteases can be prepared as 
-s genera.* known in the art. eg by Site-directed Mutagenesis, by PGR (using a PGR fragment 
conta,n,ng the desired mutation as one of the primers in the PGR reactions), or by Random 
Mutagenesu, The preparation of consensus proteins is described in eg EP 897985. The term 

- obtamed from" as used herein in connection with a given source shall mean that the polypeptide 
encoded by the nucleic acid sequence is produced by the source or by a cel. in which the nucleic 

..aad sequence from the source is presents a preferred embodiment, the po.ypeptide is secreted 
extracellularly. 

In a specific embodiment, the protease is a low-allergenic variant, designed to invoke a 
reduced .mmunological response when exposed to animals, including man. The term 
.mmunological response is to be understood as any reaction by the immune system of an animal 
exposed to the protease. One type of immunological response is an allergic response leading to 
mcreased levels of IgE in the exposed animal. Low-allergenic variants may be prepared using 
techniques known in the art For example the protease may be conjugated with polymer moieties 
sh,e,d.ng portions or epitopes of the protease involved in an immunological response. Conjugation 
wrth polymers may involve in vitro chemical coupling of polymer to the protease, e.g. as described 
•n WO 96/17929, WO 98/30682, WO 98/35026, and/or WO 99/00489.. Conjugation may in addition 
or alternatively thereto involve in vivo coupling of polymers to the protease. Such conjugation may 
be achieved by genetic engineering of the nucleotide sequence encoding the protease, inserting 
consensus sequences encoding additional glycosylate sites in the protease and expressing the 
protease ,n a host capab.e of g.ycosylating the protease, see e.g. WO 00/26354. Another way of 
proving low-allergenic variants is genetic engineering of the nucleotide sequence encoding the 
please so as to cause the protease to seff-oligomerize, effecting that protease monomers may 
sh,e.d the eptopes of other protease monomers and thereby lowering the antigenicity of the 
oligomers. Such products and their preparation is described e.g. in WO 96716177. Epitopes 



involved in an imn.unoiog.ca, responsa nay ba idanafiad bv various mathads such as ,ha phaua 

c ,^h ae w _ . Properties of the protease by known gene manipulation techniques 

oh as s,te ducted mutagenesis (see e.g. WO 00/26230, WO00/26354 and/or WO 00/22 o" 

acti^ T T V T BCt ° f ^ inVenti ° n re ' ateS t0 3 SeCret6d P ° ,y P e P fide which ^ protease 
cbvty preferably a,pha-.ytic endopepfldase activity, which poiypeptide comprises at .It three 

non-polar or uncharged polar amino acids within the last four amino acids of the C-terminus of the 
polypeptide, and which polypeptide: terminus of the 

^ Z ? i S Z ^ amin ° S6qUenCe WhiCh iS 31 ISaSt 70% ' or ***** 75%. 80% 
5%. 86%. 87%. 88%. 89%. 90%, 91% , 92% , 93 o /o , 9 < 0/o . 95%> 96% ^ 98 * 

LT* t0 amin ° aC ' d S6qUenCe ° f the mature P art <* ^ polypeptide shown 

' ' ^ 7w r 'Z ZT n ° aCid S6qUenCe ^ " 3t ,6aSt 70% ' ° r P^erably 750/0. 8£)% 
85%, 88%, 87o/ 0 , a8%, 89%. 90%. 91%. 92%. 93%. 94%. 95%. 96% 97% 98% or 

e^tl* 0 ^ 3min ° SeqU6nCe ° f ^ ^ matUre Part 0f the P*P*i* 
encoded by the polynucleotide in SEQ ID NO: 1; SEQ , D NO: 2; SEQ ID NO: 25; SEQ 

ID NO: 31; SEQ «D NO: 32; SEQ ,D NO: 36; SEQ ID NO: 40; or SEQ ID NO' 44- 

lowTeT ^ 3 r ,eiC S6qUenCe *"* hybridi26S Und6r ^ -edium- 
low, med,um, med.um-high, high, or very high stringency conditions with- 

(I) a polynucleotide encoding the mature part of a protease said 
polynucleotide obtainable from genomic DNA from Nocardiopsis 
dassonviUei subsp. dassonviUei DSM 43235 by use of primers SEQ ID 
NO's: 26 and 27; from Nocardiopsis Alba DSM 15647 by use of primers 
SEQ ID NOs: 34 and 35; from Nocardiopsis prasina DSM 15648 by use of 
pnmers SEQ ID NO's: 38 and 39; or from Nocardiopsis prasina DSM 15649 
by use of primers SEQ ID NO's: 42 and 39; 

(II) the polynucleotide of SEQ ID NO: 1; of SEQ ID NO: 2; of SEQ ID NO- 25- of 
SEQ ID NO: 31; of SEQ ID NO: 32; of SEQ ID NO: 36; of SEQ ID NO- 40- 
or of SEQ ID NO: 44; ' 

(III) a subsequence of (I) or (II) of at least 500 nucleotides, preferably 400 300 
200. or 100 nucleotides, or 
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(IV) a complementary strand of (I), (II), or (III); 

(d) comprises amature part which is a variant of the mature part of the polypeptide having 
the amino acid sequence of SEQ ID NO: 28; SEQ ID NO: 33; SEQ ID NO: 37; SEQ ID 
NO: 41; SEQ ID NO: 43; or SEQ ID NO: 45, comprising a substitution, deletion, 
extension, and/or insertion of one or more amino acids; 

(e) is an allelic variant of (a), (b). (c), or (d); or 

(f) is a fragment of (a), (b), (c), (d). or (e). 

For the purposes of the present invention, the degree of identity between two amino acid 
; sequences, as well as the degree of identity between two nucleotide sequences, is determined by 
the program "align" which is a Needleman-Wunsch alignment (Le. a global alignment). The 
program is used for alignment of polypeptide, as well as nucleotide sequences. The default 
scoring matrix BLOSUM50 is used for polypeptide alignments, and the default identity matrix is 
used for nucleotide alignments. The penalty for the first residue of a gap is -12 for polypeptides 
and -16 for nucleotides. The penalties for further residues of a gap are -2 for polypeptides, and -4 
< for nucleotide. . 

"Align" is part of the FASTA package version v20u6 (see W. R. Pearson and D. J. Lipman 
.(1988),-:"lmproved Tools for Biological Sequence Analysis", PNAS. 85:2444-2448. and W. R. 
Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP and FASTA," Methods 
in Enzymology 183:63-98). FASTA protein alignments use the Smith-Waterman algorithm with no 
limitation on gap size (see "Smith-Waterman algorithm". T. F. Smith and M. S. Waterman (1981) 
J. Mol. Biol. 147:195-197). 

The degree of identity between two amino acid sequences may also be determined by the 
Clustal method (Higgins, 1989, CABIOS 5: 151-153) using the LASERGENE™ MEGALIGN™ 
software (DNASTAR, Inc.. Madison. Wl) with an identity table and the following multiple alignment 
parameters: Gap penalty of 10, and gap length penalty of 10. Pairwise alignment parameters are 
Ktuple=1, gap penalty=3, windows=5, and diagonals=5. The degree of identity between two 
nucleotide sequences may be determined using the same algorithm and software, package as 
described above with the following settings: Gap penalty of 10, and gap length penalty of 10. 
Pairwise alignment parameters are Ktuple=3, gap penalty=3 and windows=20. 

A fragment of one of the encoding polynucleotide sequences of the invention is a 
polynucleotide which encodes a polypeptide having one or more amino acids deleted from the 
amino and/or carboxyl terminus compared to the full-length amino acid sequence. In one 
embodrment a fragment encodes at least 75 amino acid residues, or at least 100 amino acid 
residues, or at least 125 amino acid residues, or at least 150 amino acid residues, or at least 160 
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polymorphs w«h,n popul lt J "T* ""^ ^ ^ " 

of a polypeptide » a poiypepsde encoded by en alMe variant of a gene 

sequences of r a ) ftrt or w k a selected from amongst the nucleic acid 

^ uoiw 43235 by use of primers SEQ ID NO's- 26 *nri «>7. *r jv/„ . 

Mm DSM 15647 by use of prime* SEQ ,D NOV 34 and 35 from T Jr 

15648 by use of primers SEQ ,D NOV 38 Z ^ f ' AteaW * ops « pras ™ ° SM 

use of primers SEQ ID NOV 42 and 3 f k AtocanSopsfc prtrs/na DSM ,5649 by 

nude* add probe to «J£ J^lT ' ^ ,here °'' "* — * **" a 

stains of different general s " j£ ^"7^ ^ haVin9 "** tem 

9 ara or speo.es according to methods well known in the art. In particular. 
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such probes can be used fo, hybridan with me genomic or cDNA of the genus or species of 
•merest Wl owrng mmaml Southem ^ ' pec.es of 

b* should be a, fee* , 5 , prefe rab,y „ fees, 25, end more p^erably a, ieas, 35 „„c Jd^ 
tan* Longer probes can afso be used. Bofh ONA and RNA probes can be used. The probes Je 
^cally iabeled for defacing the corresponding gene (,or example, with »P -H » S l2, Z 
evidin). Such probes are encompassed by the present invention ' 

Thus, a genomic DNA or cDNA library prepared from such other organisms may be 
•screened for DNA tha, nybrtdfces with *. probes described above and vlich encZsl 

stZ d b 9 ^ ^ °' ^ — *- "gen - may be 

31 39ar0 : " POlyaC ^ m ' de - or other separate Lnigues. 

Zrt iltTeT " T ra,ed " ™ y 69 *— — » « — » "^iiuioset 
D NO. 1 or a subseguence .hereof, the earner materia, is used in a Southern biot For P „rpe3 
*. Present mvemion. hybridan indues that me nucieio acid seguence hybnTeH a 
iabeied nuctaic actd probe co.espond.ng to the nucieic acid seguence shl i„s EQ 7 N 0- , J 
.. complement sb*nd, or a. subseguence .hemof. under ve*y tow to V e^ hfch sbingenc, 

zrjnr. : whfeh ,he nuddc acM proba *— — * 1= 

20 oonditior '""h !"*" °' a ' 100 " UCle0 '" ,eS to9,h ' ^ to ««Y ■** stringency 
oo d bo are deflned M prehybridizal|on ^ ^ .^^ ^ sx * uency 

J' d6na,Ured Sa ' m0n ^ 3 " d — 25% *— * » -f and o" 
stn gences. 35% lommmide for medium and medium-high stringencies, or 50% Lam,* " 

hrgh and very high atdngencias. ,o»o»ing s te „dam Soumem bto«ing procedu.s 

For ^ ■**« <* =< '-as. 100 nuclides in lengm. the oerriar materia, is finally washed 
hree tones each for ,5 minu.es using 0.2 x SSC, 0.2% SDS. 20% formamide preferably e. 

^mo^r^r^cThT 30 ' at ieast a eo ' c <medium - N9h — * 

30 high sblngency, < "* S, " n9enCy) ' " - « « « ^ 

a— F °' Sh ° rt , : Pr0beS ab0ul 15 nu *°W« to about 70 nucleotides in length, abingency 
conditions are defined as Drehx/hrirtrr^fi^ u ^ ^ wmgency 

10-0 belo» .he calcule^riT h IT ' ^«a«on a. 5'C «o 
Proceed™. T * calculate, according to Bolton and McCarthy (1062, 

3 5 p ~™ r r^n; u rrr in 09 m Naa ' 009 m *™ 

nk-40, 1X Denhardts soluHon, 1 mM sodium pyrophosphate, 1 mM 
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r:zrr^rr °- 1 mM atp ' and 02 - - »- rna - ■- — . -— 

For short probes about 15 nucleotides to about 70 nucleotides in tenon, th. ~ • 
materia, is washed once in 6X SSC p,us 0.1% SDS for 15 minutes nZZ 
usingSXSSCatSXtolO-CbetowtheoatculatedT^ 

The present invention also relates to variants of the pcjypepade of the invent!™, 
oon,p^n 3 esubs«u to .dete«oo.and,orinsertiono,oneor m oroaZaX ' 

In a particular embodiment, amino add changes are of a minor nature, that is conservative 
; ammo a. d suasions the, do no, s^oby anect the folding and/or aOivity o,7e pit 
small delebons typlcaiiy of one te about 30 amino adds; small antinc- or 

reatdues. or a small extension that facilitates purification by changing net charoe or anoHv, r 
. »^esapo^his«d 1 nau a c,.anan B ge™cep«opeo/ab,ndinldon,l 

t*~ and htsbdtne); actdtc amino acids (glutamic add and asparbc add,, polar amino adds 
•(glutamtne and asparaglne). hydrophobic amino acids Oeucine. isoleuclne nd valine, aroll - 
am.no ac,ds. (phenylalanine, t^ophan and tyrosine,, and small amino adds (tfydnt ~ 

si r ma,hi ° nine) - ^ "*--« -** * no, generaity atT ' 

r , ^u. Thr.e,. ^nr zz =srr;r:r 

LysfArg. AsrVAsn, Leu*. LeuAfal. AlafGlu. and Asp/Gly as we,, as these in reveroe ' 
In a particular embodiman,. ma polypepbdes o, me invamion and for use according ,0 me 
, enbon are aad-stebte. For me presen, purposes, the term add-s,able means the, 
ac**, after 2 hours o, incubabon a, p H 3.0 and 37X. is a, least 50%. as comparLTth 
refuel acbtrty of a corresponding sample incubated for 2 hours a, pH 0.0 and 5-C. ,„ a^u a 
embody, ma residua, activity is a, teas, 60%. 70%. 80% or a, leas, 00% 

In particular embodimente. the pdypepbde of me invention is i) a bacterial protease- in a 
protean o me pnyHum A«*tobac terf a; » 0 of me Cass *«U M HT« e 

pr^sH /I"' *• *"* «0 * »e 0-s Mx^U and/la 

w. tot example Nocardiopsis dassonvillei DSM 43235. 
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Garrttv I"! rT» ,a T ,,y " 8CC ° rd '" ,g '° ^ road ma » <° by G U 

? D h p = " ^ ManUa ' 0,S ' s,em * Bac.eHC.gy. 2001. second edition volume 
1 . David R. Bone, Richard W. Castenholz. 

5 .he JeT^TT *" " *• «»— ■» 'nvenaon encompasses bo.h 

ft. Per.eC and ..mperfec. aa.es. and omer taxonomic equivaiems, e.g., anamorphs. regardiess o, 
fte pec.es name by whfch mey are known . ^ sk „ |ed ,„ ^ J ^ - - 

idenbty of appropriate equivalents. 

• ^ S **"*~ ' heSe ^ readi ' y aCCeSSibto to «» " ""^er of cul«ur* 

. collecbons. such as me American Type Culture CotlecUon (ATCC) Deutsche Samm7u„7 
10 MiKroorganismen und ZeiiKulturen GmbH (DSM, Denfraaibul vl^ZZ^Z^ 

dZ ', rr PKS SOTnW " a ' «** DSM «* <* P"b«dy available from 

DSMZ (Deuteche Sammlung »on Mlkroorganismen und Z e,,Ku lt uren GmbH. Breunschvl 

Furthermore, such polypeptides may be identified and oblained from other' sources' 
l^T— f rom.na,ure <e.g.. S oi,,compos,s, wa.e,.e.c, usin a HZ 

mended probes. Techniques for iso«ing miooorganisms from na.ura, habiti are wes t™ 

20 cdIia h , "** "*" n " ^ *" 116 * h - * *** a genomH 

has JTd . 2 m,Cro °~ °~ a «— - sequence encoding a ^yp^ 

.eohnrques wh,ch are Known .o .hose o, ordinary sKii, in me art (see. e.g.. SambrooK e,a, 1^ 

» *T T* a " * iSO ' a,atr fe a PO'VPsp«de which is essentiaily free of 

ot, h ?T ' e ' 9 - " ^ ab ° U ' 2<>% PU ' e ' P ' efaral * 31 about «• P- mort 

TT, ^ — — prefarab * about 80% »- mm 

pure, and even mos. preferably about 95% pure, as detemrined by SDS-PAGE 
t_ ^"V m enC ° ded ^ nUOteiC ^ SeqUenCeS °* ' he preaart *o include 

N-tem^us or the C-,erm,nus o, the polypeptide or fragment thereof. A fused polypeptide is 
produced by fusing a nucieic acid sequence (or a portion thereof, encoding another po,^ 
a nude,c add sequence (or a portion thereof „, me presen, ^venHon. Techniques fo, pZZ 
u ,on po^pepedes are Known in th e art. e.g. PCR. or tigering the coding sequences eno^hl 
polypeptides so fha, fhey afe in frame and Ura, expression of fhe fused poiypephde is under 
5 confrol of the same promoterfs) and tenninator. Pa'yPephde ,s under 
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In the present context, non-polar amino acids are G. A. V, I. I, M, P, F or W and 
uncharged polar amino acids are S, T, N. Q, Y, og C. The terms "non-polar- and "uncharged polar" 
when used to describe amino acids in a polypeptide are generally recognized in the art as 
characterizing the side-chain of the amino acid. Hence, for instance, the free carboxylic acid of the 
c-terminal amino acid in a polypeptide is not considered when determining whether this amino 
acid is a non-polar or uncharged polar amino acid. 

A preferred embodiment releates to a polypeptide of the first aspect which mature part is a 
wildtype polypeptide; an artificial variant of a wildtype polypeptide said variant having one or more 
; amino-acid(s) added to the C-terminus as compared to the wildtype and preferably the one or 
more added.amino acid(s) is (are) non-polar or uncharged and even more preferably the one or 
more added amino acid(s) is one or more of Q, S, V, A, or P; a shuffled polypeptide; or a protein- 
engineered polypeptide. 

Another preferred embodiment relates to a polypeptide of the first aspect, wherein the one 
or more added amino acids are selected from the group consisting of: QSHVQSAP QSAP QP 
TL. TT, QL, TP, LP, Tl, IQ, QP ; P|, LT, TQ, IT, QQ. and PQ. 

The inventors determined, that the polypeptides ofthe present invention were produced in- 
.even :greater yields when they.were expressed as mature proteases fused to a heterologous pro- 
region, as shown in the examples below. 

Accordingly, a preferred embodiment relates to the polypeptide according to the 
first aspect which when expressed and before maturation comprises a heterologous pro-region 
from a protease; preferably the pro-region is derived from an S2A or S1E protease more 
preferably the pro-region is encoded by a polynucleotide which hybridizes under very low low 
medium-low. medium, medium-high, high, or very high stringency conditions with a polynucleotide 
encoding the pro-region shown in position -166 to -1 of SEQ ID NO: 28, in position 1-166 of SEQ 
ID NO: 30. in position -167 to -1 of SEQ ID NO: 33, in position -165 to -1 of SEQ ID NO- 37 in 
position -165 to -1 of SEQ ID NO: 41, in position -165 to -1 of SEQ ID NO: 43, in position -165 to - 
1 of SEQ ID NO: 45, in position 1-165 of SEQ ID NO: 46. in position 1-166 of SEQ ID NO" 47 in 
position 1-166 of SEQ ID NO: 48. in position 1-166 of SEQ ID NO: 49. in position 1-166 of SEQ ID 
NO: 50, in position 1-165 of SEQ ID NO: 51, in position 1-166 of SEQ ID NO: 52, or in position 1- 
166 of SEQ ID NO: 53; and most preferably it is at least 70% identical, or preferably 75% 80% 
85%, 860/,. 87%. 88%. 89%. 90%. 91%. 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, identical 
to the pro-region shown in position -166 to -1 of SEQ ID NO: 28. in position 1-166 of SEQ ID NO- 
30. in position -167 to -1 of SEQ ID NO: 33. in position -165 to -1 of SEQ ID NO: 37. in position - 
165 to -1 of SEQ ID NO: 41 . in position -165 to -1 of SEQ ID NO: 43, in position -165 to -1 of SEQ 
ID NO: 45, in position 1-165 of SEQ ID NO: 46. in position V166 of SEQ ID NO: 47. in position 1- 
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166 of SEQ ID NO: 48. in position ,.,66 of SEQ ID NO: 49. In position 1-166 of SEQ ,D NO- 50 in 
p-» of SEO ID NO: 5, . in position ,.,66 of SEO ,D NO: S 2 . or in posi«on »Z « 

When the particular Cteminal amino add conjuration of the polypepnde of me invert™ 
was coined „*h an heterologous secna»on signal pepdde fused ,o ZLntrl £TZ 
po,ypep M e o, the Benton, a synergy was achieved end a greater yield resulted 

Accordingly, a preferred embodiment of the invention relates ,o the polypeptide of the Are, 

. from the potypepbde when ,he polypepdde is secreted, preferarty the heterologous section 
,gna peptide ,s dedved from a heterelogous protease: preferably the heteroiogous secret 
argna, peptrde comprtses an an.no add sequence having a seguence Identity o, JZa 7 T2 

rrnr ™i m - **■ ^ 9o% ' ^ -* ££££ 

or SEQ ,D NO «. SeqUe " re ~ * «— «- 1 " « - 5E0 -D NO: 2. 

Nucleic-Acid Sequent 

* • The present Invention also relates to Isolated nucleic add sequences that encode .'• 
poypepude o, the presentinvenuon. Particular nudeic add sequences o, .nelson ^he 
polynudeotides 0, SEQ ID NO: ,. SEQ ID NO: 2. SEQ ID NO: 25. SEQ ID NO 3^ SEQ B £ 
« •» ID NO: 36. SEQ ID NO: 40. or SEQ ,D NO: 44. Another partita nude. c^ZZ 

432^ oT 9en0m ' C A ^ <"» DSM 

43235. The present .nvertion also encompaases nudeic add seguenoes whteh encode a 

PcWde : having ,he amino add seguence o, amino acids shown * poaHIo^ Z£ 0 

posfcons ,65 ,o ,60, o, SEQ ID NO: 43. which nucleic acid sequences tJtZZ 

correspondrng parts of SEQ ID NO: , by virtue o, the degeneracy o, the genelic code The TesT™ 

ZZ 7,7" '° SUDSe,UenCeS * " *» *~ «** encode^ ptt 

fragments that have protease activity. /H^irae 

nudeotlTT 0 ' * ' POlynUC,e0 » de ls « — eequence ton, which one or more 

"** 3 ' has been deletei Preferabi * a «— * 

50 50 sHnn J""' * teaSl 300 — Pfeferabty a, teas, 375 

2*2, « ■ OT 1000 ~ eS ' The — * 

SEQ ID NO. 2. SEQ ID NO: 25. SEQ ID NO: 31. SEQ .0 NO: 32. SEQ ,D NO: 36. SEQ ID NO- 40 
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or SEQ ID NO: 44 of at least 85%, 86, 87, 88, 89, 90, 91, 92, 93, 94. 95, 96, 97, 98, or at least 
99%. 

The techniques used to isolate or clone a nucleic acid sequence encoding a polypeptide 
are known in the art and include isolation from genomic DNA. preparation from cDNA, or a 
combination thereof. The cloning of the nucleic acid sequences of the present invention from such 
genomic DNA can be effected, e.g., by using the well known polymerase chain reaction (PCR) or 
antibody screening of expression libraries to detect cloned DNA fragments with shared structural 
features. See. e.g., Innis etal., 1990, PCR: A Guide to Methods and Application, Academic Press. 
; New York. Other nucleic acid amplification procedures such as ligase chain reaction (LCR); 
ligated activated transcription (LAT) and nucleic acid sequence-based amplification (NASBA) may 
be used. The nucleic acid sequence may be cloned from a strain of Nocardiopsis or another or 
related organism and thus, for example, may be an allelic or species variant of the polypeptide 
encoding region of the nucleic acid sequence. 

The term "isolated nucleic acid sequence" as used herein refers to a nucleic acid sequence 
which is essentially free of other nucleic acid sequences, e.g., at least about 20% pure, preferably 
at least about 40%-pure, more preferably at least about 60% pure, even more preferably at least . 
about 80% pure, .and most preferably- at least about 90% pure as determined by > agarose 
electrophoresis. For example, an isolated nucleic acid sequence can be obtained by standard 
cloning procedures used in genetic engineering to relocate the nucleic acid sequence from its 
natural location to a different site where it will be reproduced. The cloning procedures may involve 
excision and isolation of a desired nucleic acid fragment comprising the nucleic acid sequence 
encoding the polypeptide, insertion of the fragment into a vector molecule, and incorporation of 
the recombinant vector into a host cell where multiple copies or clones of the nucleic acid 
sequence will be replicated. The nucleic acid sequence may be of genomic. cDNA, RNA. 
semisynthetic, synthetic origin, or any combinations thereof. 

Modification of a nucleic acid sequence encoding a polypeptide of the present invention 
may be necessary for the synthesis of polypeptides substantially similar to the polypeptide. The 
term "substantially similar to the polypeptide refers to non-naturally occurring forms of the 
polypeptide. These polypeptides may differ in some engineered way from the polypeptide isolated 
from its native source, e.g.. variants that differ in specific activity, thermostability. pH optimum, 
allergenics, or the like. The variant sequence may be constructed on the basis of the nucleic acid 
sequence presented as the polypeptide encoding part of the polynucleotides of the invention, e.g. 
a subsequence thereof, and/or by introduction of nucleotide substitutions which do not give rise to 
another amino acid sequence of the polypeptide encoded by the nucleic acid sequence, but which 
correspond to the codon usage of the host organism intended for production of the protease, or by 
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mtroduction of nucleotide substitutions which may give rise to a different amino acid sequence 
For a general description of nucleotide substitution, see, e.g.. Ford et a/., 1991, Protein 
Expression and Purification 2: 95-107. ^allergenic polypeptides can e.g. be prepared as 
described above. 

It will be apparent to those skilled in the art that such substitutions can be made outside 
the regions critica. to the function of the molecule and still result in an active po.ypeptide Amino 
acd residues essentia, to the activity of the polypeptide encoded by the isolated nudeic acid 
sequence of the invention, and therefore preferably not subject to substitution, may be identified 
; according to procedures known in the art. such as site-directed mutagenesis or alanine-scanning 
mutagenesis (see, e.g., Cunningham and Wells. 1989. Science 244: 1081-1085) In the latter 
technique, mutations are introduced at every positively charged residue in the molecule and the 
resultant mutant molecules are tested for protease activity to identify amino acid residues that are 
cntrcal to the activity of the molecute. Sites of substrate-protease interaction can also be 
determined by analysis of the three-dimensional structure as determined by such techniques as 
nuclear magnetic resonance analysis, -crystallography or photoaffinity labelling (see, eg de Vos- 
* 3A..1992, Science 255: 306-312;.Smith et a/...1992, Journal of Molecular Biology 224- 899-904- ■ 
• Wlodaver et a/., 1992. FEBS Letters 309:. 59-64).. . . . 

The present invention also relates to isolated nucleic acid sequences encoding a 
polypeptide of the present invention, which hybridize under very low stringency conditions 
preferably low stringency conditions, more preferably medium stringency conditions more 
preferably medium-high stringency conditions, even more preferably high stringency conditions 
and most preferably very high stringency conditions with a nucleic acid probe which hybridizes 
under the same conditions with the nucleic acid sequence of the invention or its complementary 
strand; or allelic variants and subsequences thereof (Sambrook et a/., 1989. supra), as defined 
herein. 

The present invention also relates to isolated nucleic acid sequences produced by (a) 
hybridizing a DNA under very low. low, medium, medium-high; high, or very high stringency 
conditions with (i) a polynucleotide of the invention, (ii) a subsequence of (i), or (Hi) a 
complementary strand of (i). or (ii); and (b) isolating the nucleic acid sequence. The subsequence 
» preferably a sequence of at least 100 nucleotides such as a sequence that encodes a 
polypeptide fragment which has protease activity. 

The introduction of a mutation into the nucleic acid sequence to exchange one nucleotide 
for another nucleotide may be accomplished by site-directed mutagenesis using any of the 
methods known in the art Particularly usefu. is the procedure that utilizes a supercoiled. double 
stranded DNA vector with an insert of interest and two synthetic primers containing the desired 
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mutauon. The okgonudedide prime*, each complementer, ,o opposite s,rands of the vector 
extend dunng tempera,ure cycling by n^.ns of Pfu DNA polymerase. On incorporaBon of <he 
pnmers a mutated piasmid containing staggered nicka ia generated. Following temperature 
cydtng the product is treated w»h Dpn , „ hich „ specie for moated and herniated M 
to diges, the perentat DNA temptate and to aeiect for mutatawx.ntaining synthesized DNA Ofher 
procedures known in the an may atso be used. The InvenHon aiso rotates ,o an iso^, 
polynudeot.de encoding a polypeptide as defined in the first aspect. 

; Nucleic ArJ d Constructs 

The present invention aiso relates to nucleic acid constructs comprising a nudeic acid 
sequence of the present invention operabiy linked to one or more conhd sequences that direct 
* - ^g sequence in a su*ab,e host cel, under ccdiKons compatible^ 
conW sequences. Exp re ssion win be understood to include any step involved in the product o, 

• IteTon trr 9 ' " "* ™" * i-W— —Won 
translation; post-translational modification, and secretion. 

" h ' k, '"T°T ** C ° nSln,Ct " ' S deflned here ' n " a nx»ecule. aither single- or 

doub,e,stra„ded..„h,ch is isolated,™ a.„a,ura»y.occurring gene or whid, has been mod^f to . 

Zl r ^ " Syn0nym ° US W « h *• •«» «*<"<*™ cassett. 

whe the nude.0 acd construe, contains a» the control sequences mquited tor expresston o, a 
co ng sequence of the present invention. The term "coding sequence- is defined hemin as a 
nuciarc a a d sequence that directly spedfes ft. amino add sequence * ite protein product The 
boundaries of the coding sequence a re generally determined by a ribosonre binding^ 
- by .he ATG star, oodon (euk ^,es> located Jus, ups,ream o, ate op e n l," 
frame a, ,he , and oMhe mRNA and a transcript tenninator sequence ,oca,ed Jus, downs lra am 

limited to, DNA. cDNA, and recombinant nudeic add sequences 

mani„jl!r ted nUC ' eiC emodi ^ * "»*PMF" - •» presen, invenUon may be 

mm**m ,n a vanety of ways t0 provide for MpIBSSion of ,„ e ^ " 

nud«c acd sequence poor to Ms insertion into a vedor m ay be desirabte or necessary depending 
on me expression vedor. The techniques for modifying nucleic add sequences utilizing 
recombinant DNA methods are well known in the art. 

The term -contro, sequences" is defined herein to indude all componente that ate 

con.ro, sequence may be nafive ortoreign to the nudeic add sequence encoding the polypeptide. 



10423.020-OK 



18 



15- 



20 



25 



30 



35 



™T • ^ n °' " mited * ' leat,er - P*— **» -.uence. 

SZTdTT? promo,er ' ^ pepnde sequence ' and *— **» »— • « • 
ZTV" TT sequences ino,ude a promo,er ' and **- — ■—*«- 

agnate. The con.ro, sequences may be provided with .inkers for me purpose of introducing 
spec* reason sites hd«a« na ^ of ^ Mntrol sequenees ^ ^ « 

contuT enC ° d ' n9 3 PO ' yPeP " de - ^ ^ ""^ is — •** - * 

cnfigu*™ ,n wh,ch a oon.ro, sequence is appropriate* placed a. a position reiative to the 

* «ha DMA sequence such the, the contra, sequence directe , he expression of a 

The contro, sequence may be an appropdate promote, sequence, a nudeic acid sequence 
-ha, ,s receded by a hos, ce,, for expression of the nuCaic acid sequence. The pLZ 
sequence contains banacripbona. controi sequences ma, mediate the exprcaj oTZ 

m the hos, eel, of cho.ee rnduding mutant, buncated. and hybrid prorate*, and may be obta Jo 

ZXZZTT 0 "T"* orln,raM " ar po ^ ep,ides aitt,ar hamoi °^ - 

of'L •/ Xa 7' eSO ' S ^ ableprOTOters,0 ' dl ' e < : «"S'''e transcription o„he nudeic add conebucte 
Ofdte present ^„ to „. especia„y in a badena, hos, ce,,. are me promoters ob,a,nad bom ,ha E 
«* lac operon. Sfreptomyoes coe/fcofor agarase gene (dagA), 8ad»ua suSMs tevansucrase gene 

m2eT a ' Pha - amylaSe ^ ^ — — «*- 

gene (amyM,, 8aci.ua am^uefaCens aipha-amytase gene (amyQ, bLus 

pen ra ,„nasa gene <pen P ). Badte suUm xtfA and xy,B genes, and proxaryobc 

bala-lacamase gene (Villa-Kamaron e, al.. ,978. Proceedings o, me Naliona, Academy o, 

meTT Z 3?27 - 3731) ' aS M " " ^ ' aC ~ (DeBoer e, a,.. 198 3, Pr^Tgs 
he Nabonal Academy o, Sdences USA 60: 21-25). Further promoters are described in -UseM 

a"eZT mbinam bac,ena " in Scien ™ c American ' 1980 ' 242: 74 - 9 « a " d * *"*»* « 

Examples o, suitable promoters for direebng me banacripbon of me nucleic add oonsbucte 

VZTT 3 fflamen ' 0US ' Un9a ' h0S ' ^ are Pr ° ra0tere •« *• 9enea 

T - W P-teinase. 

T "'^ ^ ^ a "*=-amy,ase; Aspe^ra nj or 

~ \ 7 9lUCOam! " aSe «**—■-** "Paae. Aspe^os oo*ae atxaline 

P^ase. Aspe^us o^ae triose phosphate isomerase. Aspergillus n/dbfana ace^midase. and 

hybnd o, ,he promoters bom ,he genes ,or Aspe^us n/ger neu, ra , a,pha^my,ase and 
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ZT* ™ m Mose phospha,e tan "~* and ,nmca,ed ' a " d ~» 

In a yeas. host, useful promoters are obtained from the genes for Seccbarcmycea 
cerewsree enolase (ENO-1). Sacc^yces cemt«ae galactokinase * (GAL1 ), Sacctenjycal 
5 cerewsrae alcohol dehydro,enase/g,yceraldehyd*3.phospha.e dehydrogenase (ADH2/GAP) and 

aredescnbedbyRomanosefa/.. 1992, Yeas, 8: 423-488. 

The control sequence may also be a suitable transcription terminator sequence a 
.-sequence recognized by a host ce. to terminate inscription. The terminator sequence' is 
10 operabfy „nked to me y terminus o, me nucteic acid sequence encoding the potypep.de £ 
terminator which i, funcfiona, in me hoe, ceii o, choice may be used in me present invenl 

Pnrferred terminators for filamentous fungal host cells ate obtained from the genes for 
Aapergy oneae TAKA amylase. Aaperg** nlger glucoamylase. A^ua Nutans 

* HT ~' ~ s .T***"*"' 1 and Fusartum °"~ " 

• ' ce«w2T? ,em T 0r5 ^ h ° 5 ' "* 3,6 ° b,a '" ,ed ,r ° m me «~ Sacc/rammycea • 
" ZZ T f^™*"^ «— — «*-"«- C (CYC,,, and Saccha™,^ 

clZH ^f*^'''™'*** dehydrogenase, Other useful terminate, for yeas, host 

cells are described by Romanos el al. , 1 992, supra. 

^0 Preferred tenninatora for bacterial host cells, such as a BadBus host cell are the 

SZELT T" **"*"■* alPha - amylaSe ^ane (arnyL). me Ba*s 

zzzzzzrr amy,ase eene <amyM) ' - - *— 

'8 en m JUT SK ' UanCe a ' M 3 SU * able ' eader SKluence ' ' "°*ansla,ed region of 

Unked to me 8 termrnus of the nucleic ecid sequence encoding me polypeptide. Any leader 
sequence ma, is funcbona, In the host cel, o, choice may be used in the prL* invention 

Prefened leaders for filamentous fungal hoe, celle are obtained from the genes for 
-W'-^aeTAKAamylaseandAape^ua^^trtosephosphateisome^e 

«» • I"*" ^ h0!t ^ a ' e 0b,ained ftom *» *» Sacchammycea 

catenae enolase (ENC ,„, SaccW^ cere./stee 3-phosphoglycerata kinase, 
Saccbaromyces cen^ae alpha-facte, and Sacc/taromyces catenae alcohol 
dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP) • 

5 ,,„k.d ,1* TT • Sa< " ,e ' ,Ce ^ a ' S0 ^ 3 ""^"^ ^uenoe. a sequence operably 
linked to the 3 terminus o, the nucleic acid sequence and which, when transcribed, is recognized 
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lUTr " " 3 S ' 8na ' '° •»*-«—'"«-*-. to inscribed mRNA Any 



5 ,h» rJ'TI POlVadeni " a,ion "*""«« «» "a-nentous fungal host cells are obtained from 
the gene, for Aspergillus cry.ee TAKA amylase. Aspergillus mger glucoamylase A^Z 

■rrrr s ~ — ~ — -err 

ntZT T ,0 ,he amlno ,em,inus °' a po ^ lde and d *~* - «— 

m.o the ce» s secretory pathway. The 5' end of .he coding sequence of ma nudeio add sequence 
™ y nheren,,, contah, a signa, peptide coding region nature,,, ,inKed in translation ree^ZZ 

S«d of the cod m g sequence may contain a signa, peptide coding region which i, foreign to the 

.trrzr. T'rr *~ ™> »* «— — srr 

■ ^aquence does not na,ura.y. contain a signa, pepade coding region. .Alternative* ., the foreign 
e,g„a pepbde coding region may simpiy rep ,ace the naura, signa, peptide codir* 2££Z 
o enhance secretion of the polypeptide. However, any signa, peptide coding rjon Ich dire* 
0 .he =x , eased pCypepW. ,„,o me aeonatory pamway o, a host ce„ o, cho,oe may be ultS 
present invention. . y 0 ,n me 

Effecsve signa, peptide coding regions for bacterial host cells are .he signal pepade codino 
regtons obtained from the genes for flacks NCI8 ,1637 maitogenic oraTsT B^Z 

5 rr zrrr*"* e. ^r*r: 
rjs^T pe * are * simonen and pai - 1 ^ 

EM, signa, pepede coding regions for fi,amen,ous funga, host ce«s are me signs, 

nrger n utra, my,ase. Aspergillus nlger g.ucoamy.ase. ^ aspa(Sc ' .J^se 

Humrcdaursdens cellulose, and Mmicofe fanugfnosa «pase Polemase. 

cere JTT *"* ^ °* " ^ - *» Sa ~ »~ 

cerevrsrae alpha-factor and Saochammyces cerev/Sae invertase. Char useful signa, peptide 

cod,ng regions are desedbed by Romanes af a/., 1992. supra. 
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The control sequence may also be a propeptide coding region that codes for an amino acid 
sequence portioned at the amino terminus of a poiypeptide. The resultant polypeptide is known 
as a proenzyme or propeptide (or a zymogen in some cases). A polypeptide is generaily 
.nactive and can be converted to a mature active po.ypeptide by catalytic or autocatalytic cleavage 
of the propeptide from the polypeptide. The propeptide coding region may be obtained from the 
genes for BacVus subWs alkaline protease (apr£), BacWus sums neutral protease (nprT) 
S*~yces cewsiae alpha-factor, ^omuoor m i eh ei aspartic proteinase, and 
Mycefiophthora thermophila laccase (WO 95/33836). 
; . Where both signal peptide and propeptide regions are present at the amino terminus of a 
polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide and 
the s,gnal peptide region is positioned next to the amino terminus of the propeptide region 

It may also be desirable to add regulatory sequences which allow the regulation of the 
expression of the polypeptide relative to the growth of the host cel.. Examp.es of regulatory 
systems are those which cause the expression of the gene to be turned on or off in response to a 
chem.ca, or physica. stimulus, including the presence of a regulatory compound. Regulatory- 
-systems ,n prokaryotic.systems include the ,ac, tac, and trp operator systems. In .yeast, the ADH2 
. -system or GAU system may be used. ln ; filamentous fungi, the TAKA alpha-amylase promoter < 
AspergMus nzgerg.ucoamy.ase promoter, and AsperglHus oryzae g.ucoamyiase promoter may be 
used as regulatory sequences. Other examp.es of regulatory sequences are those which a.low for 
gene amplification. In eukaryotic systems, these include the dihydrofolate reductase gene which is 
amphfied ,n the presence of methotrexate, and the metalbthionein genes which are amplified with 
heavy metals. In these cases, the nucleic acid sequence encoding the polypeptide would be 
operably linked with the regulatory sequence. 

Expression Vectors 

The present invention also relates to recombinant expression vectors comprising a nucleic 
acd sequence of the present invention, a promoter, and transcriptional and'translationa. stop 
agnate. The various hudeic acid and contro. sequences described above may be joined together 
to produce a recombinant expression vector which may include one or more convenient restriction 
s.tes to a..ow for insertion or substitution of the nucleic acid sequence encoding the polypeptide at 
such sites. Alternatively, the nucleic acid sequence of the present invention may be expressed by 
.nserting the nucleic acid sequence or a nucieic acid construct comprising the sequence into an 
approbate vector for expression. In creating the expression vector, the coding sequence is 
.ocated ,n the vector so that the coding sequence is operab.y .inked with the appropriate contro. 
sequences for expression. 
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The recombinant expression veolor may be any vector (e.„., a piasmld or virus) which can 
be convene subjected to recombinant DNA procedures and can bdno about me expression „ 
the nude,c ecd sequence. The choice of the vector wil, typically depend on the compatibility of the 
vector w„h the host cel, into which the vector is to be introduced. The vectors may be Lr o, 
closed circular plasmids. 

The vector may be an autonomously replicating vector, /.e., a vector which exists as an 
extrachrcmosoma. entity, the repHcation of which is independent of chromosomal replication e a. 
a plasmid, an extrachromosomal element, a minichromosome. or an artificial chromosome The 
vector may contain any means for assuring self-replication. Alternatively, the vector may be one 
which when introduced into the host cel.. is integrated into the genome and replicated together 
with the chromosome(s) into which it has been integrated. Furthermore, a single vector or p.asmid 
or two or more vectors or plasmids which together contain the total DNA to be.introduced into the 
genome of the host cell, or a transposon may be used. 

The vectors of the present invention preferably contain one or more selectable markers 
wh,ch perm, easy selection of transformed cells. A sdectable marker is a gene the product of 
•• -which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to 

" ■ sirrr? 6 - Examp,es ofbacteriai se,ectawe - *■ - 

ZT^Tl - SUitaWe markSrS f ° r ^ h ° St Ce " S are ADE2, H.S3, LEU2, 

20 include Z ; and ^ Se,6Ctab,e ^ ^ ^ in 3 fi — S « -I, 

20 nch.de . but are not mm to, amdS (acetamidase), ergB (ornithine carbamoy«transferase,, b er 

(phosphinothncin acetyltransferase), hygB (hygromycln phosphotransferase). n/aD (nitrate 

reductase) pyrG (orotidine-5'-phosphate decarboxylase). sC (sulfate adenyltransferase), trpC 

the T n r e TT " " CqUiVa,entS there ° f - Preferred f ° r USe an ***** ~» - 
the amc/s and pyrG genes of Aapmgmm n . duians ^ Asp ^ ifius ^ ^ ^ ^ 

25 Streptomyces hygmscopicus. 

The vectors of the present invention preferably contain an elements) that permits stable 
.n tegration of the vector into the host cell's genome or autonomous replication of the vector in the 
cell independent of the genome. 

For integration into the host cell genome, the vector may rely on the nucleic add sequence 
encoding the polypeptide or any other e.ement of the vector for stable integration of the vector into 
the genome by homologous or nonhomologous recombination. Alternatively, the vector may 
contain additional nucleic acid sequences for directing integration by homologous recombination 
into the genome of the host cel.. The additional nucleic add sequences enable the vector to be 
integrated rnto the host cell genome at a precise location**) in the chromosome(s). To increase 
the ..ke.ihood of integration at a predse location, the integrations elements should preferably 
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contam a sufficient number of nucleic acids, such as 100 to 1,500 base pairs, preferably 400 to 
1.500 base pairs, and most preferably 800 to 1.500 base pairs, which are highly homologous with 
the corresponding target sequence to enhance the probability of homologous recombination The 
■ntegrafonal elements may be any sequence that is homologous with the target sequence in the 
genome of the host cell. Furthermore, the integrational elements may be non-encoding or 
encoding nucleic acid sequences. On the other hand, the vector may be integrated into the 
genome of the host cell by non-homologous recombination. 

For autonomous replication, the vector may further comprise an origin of replication 
; enabling the vector to replicate autonomously in the host cell in question. Examples of bacterial 
ongms of replication are the origins of replication of plasmids pBR322. pUC19. pACYC177 and 
PACYC184 permitting replication in £. coli, and pUB110. pE194. pTA1060. and pAMB1 permitting 
replication in BaaUus. Examples of origins of replication for use in a yeast host cell are the 2 
m,cron origin of replication. ARS1. ARS4. the combination of ARS1 and CEN3 and the 
combination of ARS4 and CEN6. The origin of replication may be one having a mutation which 
- makes ,t functioning temperature-sensitive in the host ceJ..(see. e.g.. Ehrlich. 1978. Proceedings of 
\ the National Academy of Sciences.USA 75: 1433). 

More than one copy of a nucleic acid sequence of the present invention may be inserted 
.nto the host cell to increase production of the gene product An increase in the copy number of 
the nucleic acid sequence can be obtained by integrating at least one additional copy of the 
sequence into the host cell genome or by including an ampfifiable selectable marker gene with the 
nucle,c acid sequence where cells containing amplified copies of the selectable marker gene and 
thereby additional copies of the nucleic acid sequence, can be selected for by cultivating the cells 
in the presence of the appropriate selectable agent. 

The procedures used to ligate the elements described above to construct the recombinant 
express.cn vectors of the present invention are well known to one skilled in the art (see eo 
Sambrook et a/., 1989. supra). ' 

The protease may also be co-expressed together with at least one other enzyme of 
-merest for animal feed, such as phytase (EC 3.1.3.8 or 3.1.3.26); xylanase (EC 32 18V 
galactanase (EC 3.2.1.89); alpha-ga.actosidase (EC 3.2.1.22); protease (EC 3 4--)' 
phospholipase A1 (EC 3.1.1.32); phospholipase A2 (EC 3.1.1.4); lysophospholipase (EC 3 1 1 5)' 
phospholipase C (3.1.4.3); phospho.ipase D (EC 3.1.4.4); and/or beta-o.ucanase (EC 3.2 1 A or 
EC 3.2.1.6). 

The enzymes may be co-expressed from different vectors, from one vector, or using a 
mixture of both techniques. When using different vectors, the vectors may have different 
selectable markers, and different origins of replication. When using only one vector, the genes can 
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be expressed from one or more promoters. If cloned under the regulation of one promoter (*. or 
multwcistronic). the order in which the genes are cloned may affect the expression levels of the 
proteins. The protease may also be expressed as a fusion protein, i.e. that the gene encoding the 
protease has been fused in frame to the gene encoding another protein. This protein may be 
another enzyme or a functional domain from another enzyme. 

Accordingly, the invention also relates to a recombinant expression vector or 
polynucleotide construct comprising a polynucleotide of the invention. 

- Host Cells 

The present invention also relates to recombinant host cells, comprising a nucleic acid 
sequence of the invention, which are advantageously used in the recombinant production of the 
polypeptides. A vector comprising a nucleic acid sequence of the present invention is introduced 
into a host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating 
extra-chromosomal vector as described earlier. The term "host cell" encompasses any progeny of 
a parent cell that is not identical to the parent-cell due to mutations that occur during replication 
The choice of a- host cell will to- a large -extent depend upon the gene encoding the polypeptide 
and its source. The host cell may be a unicellular, microorganism, e.g.. a prokaryote, or a non- 
unicellular microorganism, e.g., a eukaryote. 

Useful unicellular cells are bacterial cells such as gram positive bacteria including, but not 
l.m,ted to. a Bacillus cell, or a Streptomyces cell, or cells of lactic acid bacteria; or gram negative 
bactena such as E. coll and Pseudomonas sp. Lactic acid bacteria include, but are not limited to 
spec.es of the genera Lactococcus, Lactobacillus, Leuconostoc, Streptococcus. Pediococcus and 
Enterococcus. Useful unicellular cells are bacterial cells such as gram positive bacteria including 
but not hmrted to, a Bacillus cell, e.g., Bacillus alkalophilus. Bacillus amyloliquefaciens Bacillus 
brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus lautus, Bacillus lentus 
Bacillus licheniformis, Bacillus megaterium, Bacillus .stearothermophilus. Bacillus subtilis and 
Bacillus thuringiensis; or a Streptomyces cell. e.g.. Streptomyces IMdans or Streptomyces 
munnus, or gram negative bacteria such as E. coli and Pseudomonas sp. In a preferred 
embodiment, the bacterial host cell is a Bacillus lentus. Bacillus licheniformis Bacillus 
stearothermophilus or Bacillus subtilis cell. In another preferred embodiment, the Bacillus cell is 
an alkalophilic Bacillus. 

The introduction of a vector into a bacterial host cell may, for instance, be effected by 
protoplast transformation (see. e.g., Chang and Cohen. 1979. Molecular General Genetics 168- 
111-115). using competent cells (see. e.g., Young and Spizizin. 1961. Journal of Bacteriology 81: 
823-829, or Dubnau and Davidoff-Abelson, 1971. Journal of Molecular Biology 56: 209-221), 
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electroporation (see, e.g., Shigekawa and Dower. 1988 Biotechniaues 6- iao m . 

1 71) and all mtospnric fungi (Hawkswonh at a/., 1 995. sapra) ™* 
: In anothar particular embodimant. tha fungal hast caf, Is a yaast call W 3, used 

ena,n mcludas ascosporognnous yaas. (Endomycetalas). basidiaspLganous yLTand Zl 
"l F : 9i ""^ ~ «~ - —an a, yeasTm ay * ng^ 

Symposfum Sanaa No. 9. 1980). . ana uavenport, R.R., ads. Soc. App. Bacterid. 

■ -Tha yaast haat calf may . ba a Candfda. Mansam*,. Kluyvtmmyces AM. 
tSacaftaromycaa.Scnftoaacanamniycaa.aryarra.Wacall - . 

nlarnan.? 8 r Sa ' !T ^ ' tten » nto "'»«>»»l -FilanuWnus fungi" induda:' =,l 

~Um* glucan, chtosan. mannan , and other £ 

at: ;;T s :^ on ca,ab * m is ob " 9 ^ -* * ~ ~ £i 

Examplaa cf filamentous fungal hast calls ara cells af spaclaa of but nat limited ,„ 

BscKat and Guaranfa L ,7* 7£ , "*™*-*« «» — - * 

Ma/acu/are/a/oov 1,1, and Simon, M.I.. adilara. Gu«a to YW Ganaffca and 

«a,acu,a r 8 ,a/ogy. Mafnoda ,n Bm ymology , Vol uma ,94. pp ,82-187. Academic Press . ,„ c New 
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York; Ito et a/.. 1983. Journal of Bacteriology 153: 163; and Hinnen et a/.. 1978. Proceedings of 
the National Academy of Sciences USA 75: 1920. 

The invention relates to a recombinant host eel! comprising a polynucleotide of the 
mvenfon, or an expression vector or polynucleotide construct of the invention. In a preferred 
embodiment, the recombinant host cell is a Bac///us cell. 

Plants 

The present invention also relates to a transgenic plant, plant part, or plant cell which has 
; been transformed with a nucleic acid sequence encoding a polypeptide having protease activity of 
the present invention so as to express and produce the polypeptide in recoverable quantities The 
polypeptide may be recovered from the plant or plant part. Alternatively, the plant or plant part 
contammg the recombinant polypeptide may be used as such for improving the quality of a food or 
feed. e.g., .mproving nutritional value, payability, and rheological properties, or to destroy an 
antinutritive factor. 

• In a particular embodiment, the polypeptide is targeted- to the endosperm storage 
vacuoles .n seeds. This can be obtained by synthesizing it as a precursor with a suitable: signal 
pept.de, see Horvath et al in PNAS, Feb. 15. 2000. vol. 97. no. 4, p. 1914-1919. • , , , 

The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a monocot) 
or engineered variants thereof. Examples of monocot plants are grasses, such as meadow grass 
(blue grass. Poa). forage grass such as festuca. lolium. temperate grass, such as Agrostis and 
cereals, e.g.. wheat, oats. rye. barley, rice, sorghum, and maize (corn). Examples of dicot plants 
are tobacco, legumes, such as lupins, potato, sugar beet. pea. bean and soybean, and cruciferous 
Plants (family Brassicaceae). such as cauliflower, rape seed, and the closely related model 
orgamsm Arabidopsis thaliana. Low-phytate plants as described e.g. in US patent no 5 689 054 
and US patent no. 6.111.168 are examples of engineered plants. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers. Also 
specfic plant tissues, such as chloroplast, apoplast, mitochondria, vacuole, peroxisomes, and 
cytoplasm are considered to be a plant part. Furthermore, any plant cell, whatever the tissue 
origin, is considered to be a plant part. 

Also included within the scope of the present invention are the progeny of such plants 
plant parts and plant cells. 

The transgenic plant or plant cell expressing a polypeptide of the present invention may 
be constructed in accordance with methods known in the art. Briefly, the plant or plant cell is 
constructed by incorporating one or more expression constructs encoding a polypeptide of the 
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present invention into the plant host genome and propagating the resu.ting modified plant or plant 
cell into a transgenic plant or plant cell. 

Conveniently, the expression construct is a nucleic acid construct which comprises a' 
nuc.e,c ac,d sequence encoding a polypeptide of the present invention operably linked with 
appropnate regulatory sequences required for expression of the nucleic acid sequence in the plant 
or plant part of choice. Furthermore, the expression construct may comprise a selectable marker 
useful for .dentifying host ce..s into which the expression construct has been integrated and DNA 

"Tnll"? 83 ^ intr ° dUCti0n ° f COnStmCt int ° the Plant in < uestion V* 'a«er depends 
; on the DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator sequences and 
opbonally s,gnal or transit sequences are determined, for example, on the basis of when, where 
and how the polypeptide is desired to be expressed. For instance, the expression of the gene 
encodmg a polypeptide of the present invention may be constitutive or inducible, or may be 
developmental, stage or tissue specific, and the gene product may be targeted to a specific tissue 
or- plant part such as seeds or leaves. Regulatory sequences are. for example; described by 
..•VTagueefa/., 1988. Plant Physiology 86:506. . . r 

Cell 2vZ ~ VeeXpression ' the 35 ^aMV. promoter may be used (Franck "et aA.W 
Cell 21. 285-294). Organ-specific promoters may be. for example, a promoter from storage sink 
hssues such as seeds, potato tubers, and frufe (Edwards & Coruzzi, 1990. Ann. Re, Genet 24- 

Zt^l " metab ° liC tiSSU6S SUCH 35 menStemS (,t ° • 1994 . Biol 24 

fl I"!' r dfiC Pr ° m0ter SUCh 38 ^ 9lUte ' in ' Pr ° ,amjn - 9lobu,Sn ' or aibu ™ P-o^te 
from nee (Wu et a/.. 1998. Plant an* Cell Physiology 39: 885-889). a Vioia feoa promoter from the 

tegurmn , 84 and the unknown seed protein gene from Viola fa ba (Conrad et a/.. 1998. Journal of 

* 7r n Z 9Y 711X 3 Pr ° m0terfr0 - a - bo* protein (Chen eta!.. 1998. Plant 

and cel. Phys,o«ogy 39: 935-941). the storage protein napA promoter from Brassioa napus. or any 
other seed specific promoter known in the art. e.g.. as described in WO 91/14772. Furthermore 
he promoter may be a leaf specific promoter such as the noes promoter from rice or tomato' 
(Kyozuka et a/.. 1993. Plant Physiology 102: 991-1000. the chlorella virus adenine 
methyltransferase gene promoter (Mitra and Higgins. 1994. Plant Molecular Biology 26: 85-93) or 
the a/oP gene promoter from rice (Kagaya et a/.. 1995. Molecular and General Genetics 248- 668- 
674). or a wound inducible promoter such as the potato pin2 promoter (Xu et el., 1993. Plant 
Molecular Biology 22: 573-588). 

A promoter enhancer element may a.so be used to achieve higher expression of the 

™ Z T* FM ' nS,anCe ' Pr0m °' er enhanCer e,emert ™* ta - W™- -*ich is 
Placed between the promoter and the nucleotide sequence encoding a polypeptide of the present 
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iTetir " ,StanCe ' X " * 1 " 3 ' ^ **" UM - the « * •» *» a* 1 
gene to enhance expression. 

Still furmer ft. codon usage may be optimized for the plan, species in question to 
improve expression (see Hoivath et al referred to above). 

The selectable martcer gene and any other parts of the expression construct may be 
chosen from those available in the art. 

The nucleic acid construct is Incorporated into the plant genome accord™ to 

• Z^tTT ^ tranJoZ: 11° 

. med atod transformation, micrainjecdon. particle bombammem. bio«stic ,ranstom,a,ion and 
etocboporabon (Gasser a, af. ,990. Science 244: ,293; Poby k us. ,990. 535 
Shimamotoe/al.. ,989,/vafura338: 274). 'ecnnorogy a. 535, 

o— H P T Sn% ' ^ 9ro " aC ' ert '' m <™^«*»»«ediated gene .ransfer is the method o, choice tor 
,17 « ' ^ a ' S0 be ,W •""•A""** ™noco te . altoough other 

for generabng bansgen,c monoco.s-.is particte bombardment (micraacopic gold or.,„ngs,en 
1Tp,T T ** tranS ' 0m,in9 DNA> °* — <<-eloping.eLyos (CnnC 

moll Tt *"**'*«' 10: 667 " 674) - ^ abernabve method tor transformation o 

ZZTl °" Pm,0P,aS ' ""*""*» - »y Omirulieh e, a,.. ,993 P ,an, 

Molecular Biology 21: 415-428. ' aw 

construe™ 0 ""? ,ranS,0 ' Tna " 0n ' *•"*"»"- having incorporated .herein the expression 
construe, are selected and rege„ara.ed Into whole p,a«s acceding to methods we»^wn in 2 

The present invention also relates to methods for producing a polypeptide of the present 
nvenfcon compfcing (a, cu«va«ng a .ransgenic plan, or a p,an. cel, combing a £ZZ 
~quence encoding a polypepbde having pmtease activity o, me presen. Lenbon undt 

Zn 7, T 6 ProdUC *° n * "* (b) ** POl^epBde. Tlve 

mv^on reiates to a bansgenic plant or plan, part, comprtsing a polynucleotide » defined * 
clam, 8. or an expression vector or polynucleotide construe, of ,he invention 



Animals 



e, e . T T* ' nUan,,0n ^ '° 3 " anS9eniC - n ° n - hUma " a " lmal *"« P<°ducto or 

elemems .heraof. examp.es o, which are body fluids such as mi* and blood, organs gash an" 

anmta, cells. Techniques for expressing proteins, e.g. in mammalian ce,,s. ara k own n toe art 
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see e* the handbook Protein Expression: A Practical Approach, Higgins and Hemes (eds) 
Oxford University Press (1999), and the three other handbooks in this series relating to Gene 
Transition. RNA processing, and Post-translation al Processing. Generally speaking, to prepare 
a transgenic animal, setected cells of a selected animal are transfonned with a nucleic acid 
sequence encoding a polypeptide having protease activity of the present invention so as to 
express and produce the polypeptide. The polypeptide may be recovered from the animal eg 
from the milk of female animals, or the polypeptide may be expressed to the benefit of the animal 
-tself, e^g. to assist the animal's digestion. Examples of animals are mentioned below in the 
; section headed Animal Feed. 

To produce a transgenic animal with a view to recovering the protease from the milk of the 
ammal, a gene encoding the protease may be inserted into the ferti»ized eggs of an animal in 
quest.cn, e.g. by use of a transgene expression vector which comprises a suitable milk protein 
promoter, and the gene encoding the protease. The transgene expression vector is microinjected 
into fenced eggs, and preferably permanently integrated into the chromosome. Once the egg 
begins to grow and divide, the potential embryo-is indented into a surrogate mother, and animals 
•carrying thelransgene are identified. The resulting. anima, can then be multiplied by conventionaf • 

M^rt XPePtide ^ PUrified fr ° m the anima, ' S See Meade.,H.M. et a. 

(1999): Expression of recombinant proteins in the milk of transgenic animals. Gene expression 
systems: Us,ng nature for the art of expression. J. M. Fernandez and J. P. Hoeffler (eds) 
Academic Press. 

In the alternative, in order to produce a transgenic non-human animal that carries in the 
genome of its somatic and/or germ cells a nucleic acid sequence including a heterologous 
ransgene construct including a transgene encoding the protease, the transgene may be operably 
hnked to a first regulatory sequence for salivary g.and specific expression of the protease as 
disclosed in WO 2000064247. 

The invention relates to a transgenic, non-human animal, or products, or elements thereof 
comprise a polynucleotide, or an expression vector or po.ynucleotide construct of the invention. ' 

Methods of Production 

The present invention also relates to methods for producing a polypeptide of the present 
mvention comprising (a) cultivating a host ce„ or a transgenic plant or anima, under conditions 
conducive for production of the polypeptide in a supernatant; and optionally (b) recovering the 
polypeptide. 9 

In the production « ^ ^ ^ ^ ^ ^ > 

med,um su,«able for production of the peptide usi „ 9 metnods ^ |n „, ^ F „, example 
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the cell may be cultivated by shake flask cultivation, small-scale or large-scale fermentation 
(including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial 
fermentors performed in a suitable medium and under conditions allowing the polypeptide to be 
expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising 
carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable 
media are available from commercial suppliers or may be prepared according to published 
compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is 
secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If 
; the polypeptide is not secreted, it can be recovered from cell lysates. 

The polypeptides may be detected using methods known in the art that are specific for the 
polypeptides. These detection methods may include use of specific antibodies, formation of a 
product, or disappearance of a substrate. For example, a protease assay may be used to 
determine the activity of the polypeptide as described herein. 

The resulting polypeptide may be recovered by methods known in the art. For example, 
the polypeptide may be recovered from the nutrient medium by conventional procedures including! 
but notlimited to.:centrifugation. filtration, extraction; spray-drying, evaporation, or precipitation.- 

The. polypeptides- of the present- invention may be purified by a variety! of. procedures 
known in the art including, but not limited to. chromatography (e.g.. ion exchange, affinity, 
hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g.. preparative 
isoelectric focusing), differential solubility (e.g.. ammonium sulfate precipitation). SDS-PAGE. or 
extraction (see. e.g.. Protein Purification, J.-C. Janson and Lars Ryden. editors, VCH Publishers 
New York, 1989). 

Compositions 

In a still further aspect, the present invention relates to compositions comprising a 
polypeptide of the present invention. The polypeptide compositions may be prepared in 
accordance with methods known in the art and may be in the form of a liquid or a dry composition. 
For instance, the polypeptide composition may be in the form of a granulate or a microgranulate. 
The polypeptide to be included in the composition may be stabilized in accordance with methods 
known in the art. Examples are given below of preferred uses of the polypeptides or polypeptide 
compositions of the invention. 

Animal Feed 

The present invention is also directed to methods for using the polypeptides of the 
invention in animal feed, as well as to feed compositions and feed additives comprising the 
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polypeptides of the invention. The term animal includes all animals, including human beings 
Examples of animals are non-ruminants, and ruminants, such as cows, sheep and horses In a 
part.cular embod,ment, the animal is a non-ruminant animal. Non-ruminant animals include mono- 
gastnc animals, e.g. pigs or swine (including, but not limited to, piglets, growing pigs, and sows)- 
poultry such as turkeys, ducks and chicken (including but not limited to broiler chicks layers)' 
young calves; and fish (including but not limited to salmon, trout, tilapia, catfish and carps- and 
crustaceans (including but not limited to shrimps and prawns) 

The term feed or feed composition means any compound, preparation, mixture or 
; composition suitable for, or intended for intake by an animal. 

In the use according to the invention the protease can be fed to the animal before, after or 
simultaneously with the diet. The latter is preferred. 

In a particular embodiment, the protease, in the form in which it is added to the feed or 
when being included in a feed additive, is well-defined. Well-defined means that the protease 
preparation is at least 50% pure as determined by Size-exclusion chromatography (see Example 
12 of WO 01/58275). In other particular embodiments the protease preparation is at least 60 70 
80, 85; 88, 90, 92. 94, or at Ieasf95% pure as determined by this method. A well-defined, protease 
preparation, is .advantageous. For .instance. Jt is much easier to dose correctly to. the feed a- 
protease that is essentially free from interfering or contaminating other proteases. The term dose 
correcHy refers in.particular to the objective of obtaining consistent and constant results, and the 
capability of optimising dosage based upon the desired effect. 

For the use in animal feed, however, the protease need not be that pure; it may eg 
include other enzymes, in which case it could be termed a protease preparation. The protease 
preparation can be (a) added directly to the feed (or used directly in a treatment process of 
vegetable proteins), or (b) it can be used in the production of one or more intermediate 
compositions such as feed additives or premixes that is subsequently added tc the feed (or used 
.n a treatment process). The degree of purity described above refers to the purity of the original 
protease preparation, whether used according to (a) or (b) above. 

Protease preparations with purities of this order of magnitude are in particular obtainable 
usmg recombinant methods of production, whereas they are not. so easily obtained and also 
subject to a much higher batch-to-batch variation when the protease is produced by traditional 
fermentation methods. Such protease preparation may of course be mixed with other enzymes 

In a particular embodiment, the protease for use according to the invention is capable of 
*>.ubilising vegetable proteins. A suitable assay for determining solubilised protein is disclosed in 
Example 11. 
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oreoara^n """^ " * a " y Mmp01md ' 

ZTLT£ " Pr0teinS ^ ~r ambodimente, the 

proton content of the vegetable proteins is a. teas. 10. 20. 30. 40. 50. or 60% (wA»). Vegetable 
protons ma, be derived from vegetabte pro.e,n souroes. suob as legumes and oereate for 

CbenopooVaceae. and Poaceae. suob as soy bean meal, lupin mea, and rapeseed mea,. In a 
palter embodiment. the ve 9 e.abte protein souroe is materia, from one or more pianfs of me 
family Pabaceae. e g. soybean, lupine, pea. or bean. 

m«™ '," emb ° diment "» ^etable protein souroe is material from one or 

eTalesT , ^X^* ^' ^ ^r o, guinea. Outer 

llbte r, 7 P,<>tein ^ are raPMeed ' ^ - b, "«' J * a P«'a™d 

vegetable pmton source. Ofber examptes of vegetable protein spumes ere cereals such as 
barley, wheal rye. oat. maize (corn), rice, end sorghum. 

■> • The Ireatntent according to the invention of vegetable protons w*h at leas, one protease of 
the rnvention results.* an .increased -sblubi„sa.ion.of,vegelable proteins. .The . Mowing are- 
■ examptes * * solubilised ^oton obtainable using ,he*roteases o, the invention in a monoga^ 
«-o model: At eas, ,02%. ,03%. ,04%. ,05%. ,06%. or a, tees, 107%. relative ,o a blank 

^jrrtr soiubi " sea ^ * de,em * nad ^ *• » 

TJ2 T T ^ *° * * •»*-» Ton, other components of 

the usually complex natural compositions such as feed. 

ce„h, ' n ,'• PartiC '" ar ""^a'- «» >«*— *» according to the invention is' 

capabte of increasing ,he amount of digeaabie vegetable proteins. The following are examp.es of 

I !! - 9 Pr< " ein 0b,ainab ' a USin9 »" Pr0,eases * "» '"-anhan in a monogastnc 
. w^o model: At teas, ,04%. ,05%. ,06%. ,07%. ,05%. ,00%. or a. leas, ,10%. relaai ,o a 

Ex^i Percenta9e °' < " 9eSted " di9eSBb ' e iS determine ° usl "a ^ m ««, mode, of 

The following ere. examples of % digested or digestible protein obtainable using the 
proteases of the inventon in en aquaculture to vlto, model: At least ,03%. 104% 105% 106% 
107%. ,06%. ,00% or a, leas, „0%. reiabve to a W an, The peroentege of digested or digesltt 
protein is determined using the aquaculture to v*n> model of Example ,2 

in a m further particular embodiment, me pmtease ,or use accoming to the invenBon is 
capable of increasing me Degree of Hydrolysis (OH, o, vege.ab,e proteins. The fo.low.no a re 
examples of Degree of Hydrolysis increase obteinable in a monogastrio to vltn, mode, A. teas, 
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£*» -ease ob,a.ab,e ln . iX." 

ouaj" "aT" emb0dimen ' °' 8 ,^ea,men, Pr ° CeSS ° ftha Posers) in 
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. : rsrr To aohfeve ^ ,ha ~ - — ^ - 

a«v*es. The annate reacbon , s UMl| ^ ^ ^ 

' "aeon an ° ,her j Part ' CUlar ^bodiman, of a .reatmen, process of ,ha invention. L protaasa 

whatever ofhar means eou,d hava bean appBed to postpone .he acfjon of .ha enzyme 

for use L" ZZT* ~ fe ' Pra ~' " *- ' ~ P— 

^ T" y "' S ' inC,eaSed Pmteln desrad * n protein 

^ZZZTZT^ITT in ~ and ,ha a * mal parf ~ 

faadraia.eto.^ht^of^r^r:^™" ^ « 

4% 5^^^°*™' ,aed CMWerel0n ^ " *»— « * - ^ 1*. 3% 
«*/o t t>/o f 6%, 7%, 8%, 9% or at Iasc* -mo/ i_ *. ^ ' 

inoraased by at leas. 2% 3V t I ""^ Pa '" CUlar emb o a ™^ •» «<*** 9ain is 

(1966) - D,raCaVe * ' a C ° mml - to " * • «• 1386 nxan, ,a methode de oa teu , de ,a vaieur 
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^rgetique des aliments composes destines a la volaille. Journal Officiel des Communautes 
Europeennes, L130, 53 - 54. 

The protease can be added to the feed in any form, be it as a relatively pure protease or in 
adm.xture with other components intended for addition to anima. feed, i.e. in the form of animal 
feed add.tives, such as the so-called pre-mixes for animal feed. 

In a further aspect the present invention relates to compositions for use in animal feed 
such as animal feed, and animal feed additives, e.g. premixes. 

Apart from the protease of the invention, the animal feed additives of the invention contain 
; at least one fat-soluble vitamin, and/or at least one water soluble vitamin, and/or at least one trace 
mineral. The feed additive may also contain at least one macro mineral. 

Further, optional, feed-additive ingredients are colouring agents, aroma compounds 
stab.l,sers, antimicrobial peptides, including antifungal polypeptides, and/or at least one other 
enzyme selected from amongst phytase (EC 3.1.3.8 or 3.1.3.26); xylanase (EC 3 218V 
ga.actanase (EC 3.2.1.89); alpha-galactosidase (EC 3.2.1.22); protease (EC 34--)' 
Phospholipase A1 (EC 3.1.1.32); phospholipase A2 (EC 3.1.1.4); lyso P hospho.ipase(EC 3 1 1 5)' 
phospholipase C (3.1.4.3); phospholipase .D (EC 3.1. 4.4), and/or beta-g.ucanase (EC 3.2.1.4 or. 
• EC. 3.2. 1.6).. . * s ,. . 

In a particular embodiment these other enzymes are welWefined (as defined above for 
protease preparations). 

Examples of antimicrobial peptides (AMP's) are CAP 18, Leucocin A. Tritrpticin, Protegrin- 
1 Thanatm. Defensin, Lactoferrin. Lactoferricin. and Ovispirin such as Novispirin (Robert Lehrer 
2000X Plectasins, and Statins, including the compounds and po.ypeptides disclosed in 
PCT/DK02/00781 and PCT/DK02/0081 2, as we., as variants or fragments of the above that retain 
antimicrobial activity. 

Examples of antifungal polypeptides (AFP's) are the Aspergillus giganteus. and Aspergillus 
nrger peptides, as well as variants and fragments thereof which retain antifungal activity as 
disclosed in WO 94/01459 and WO 02/090384. 

Usal.y fat- and water-soluble vitamins, as well as trace minerals form part of a so-called 
premix intended for addition to the feed, whereas macro minerals are usually separately added to 
the feed. A premix enriched with a protease of the invention, is an example of an anima. feed 
additive of the invention. 

In a particular embodiment, the animal feed additive of the invention is intended for being 
-ncluded (or prescribed as having to be included) in animal diets or feed at levels of 0 01 to 1 0 0%- 
more particularly 0.05 to 5.0 %; or 0.2 to 1.0% (% meaning g additive per 100 g feed). This is so in 
particular for premixes. 
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The following are non-exclusive lists of examples of these components: 

Examples of fat-soluble vitamins are vitamin A, vitamin D3. vitamin E, and vitamin K ea 
vitamin K3. ' * a ' 

Examples of water-soluble vitamins are vitamin B12. biotin and choline, vitamin B1, vitamin 
B2, vitamm B6, niacin, folic acid and pantothenate, e.g. Ca-D-panthothenate. 

Examples of trace minerals are manganese, zinc. iron, copper, iodine, selenium, and 

cobalt 

Examples of macro minerals are calcium, phosphorus and sodium. 
The nutritional requirements of these components (exemplified with poultry and 
p,g.ets/pig S ) are listed in Table A of WO 01/58275. Nutritional requirement means that these 
components should be provided in the diet in the concentrations indicated. 

In the alternative, the animal feed additive of the invention comprises at least one of the 
■ndMdual components specified in Table A of WO 01/58275. At least one means either of one or 
more of. one. or two, or three, or four and so forth up to all thirteen, or up to all fifteen individual 
-components. More specifically, this at least one individual component is included in the additive of 
...the mvention in such an amount as to provide.an in-feed-concentration within the.range indicated 
in column four, or column five, or column six of Table A. 

The present invention also relates to animal feed compositions. Animal feed compositions 
or d,ets have a relatively high content of protein. Poultry and pig diets can be characterised as 
-ndicated in Table B of WO 01/58275, columns 2-3. Fish diets can be characterised as indicated in 
column 4 of this Table B. Furthermore such fish diets usually have a crude fat content of 200-310 
g/kg. WO 01/58275 corresponds to US 09/779334 which is hereby incorporated by reference 

An an.mal feed composition according to the invention has a crude protein content of 50- 
800 g/kg, and furthermore comprises at least one protease as claimed herein. 

Furthermore, or in the alternative (to the crude protein content indicated above), the animal 
feed compos.tion of the invention has a content of metabolisable energy of 10-30 MJ/kg; and/or a 
content of calcium of 0.1-200 g/kg; and/or a content of available phosphorus of 0 1-200 g/kg- 
and/or a content of methionine of 0.1-100 g/kg; and/or a content of methionine plus cysteine of 
0.1-1 50 g/kg; and/or a content of lysine of 0.5-50 g/kg. 

In particular embodiments, the content of metabolisable energy, crude protein, calcium 
phosphorus, methionine, methionine p,us cysteine, and/or lysine is within any one of ranges 2 3 
4 or 5 in Table B of WO 01/58275 (R. 2-5). 

Crude protein is calculated as nitrogen (N) multiplied by a factor 6.25, i.e. Crude protein 
(g/kg)= N (g/kg) x 6.25. The nitrogen content is determined by the Kjeldahl method (A.OAC, 
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1984. Official Methods of Analysis 14th *»ri a« b ^ 

Washington DC). ' * l80c,abon of Official Analytical Chemists, 

Metabolisable energy can be calculated on the basis of th» MPn \ r 

In ^ ninth revised edltion . ^i^r;™ 6 "' 

5 animal nutrition board nf o^rj^.u. numtion, committee on 

Washington. D.C T« andTr ^ ^ » 

t^e^ f "t," — ,997 ' ™* over 

0-30% oa«s;and/or SoT^n T ' M ° % ^ * 70 * Baney,and/or 

. and/orCW0% soybean meat and/or.0-10%fish meal: and/orO-20% who. 

rvp^r s. ^.r^rr ,eed (non - ~- 

» minerals are edded according ,h ^ ' am ° Unte * *»*• «*< 

addad as so»d ^^11™ !TT" * "* ^ ~' *"»"»• * 
addad bafora or dunno m F ° r —I* • «* «»» formulation is M 

^ ^ 6 inc "" ora,ed « <*«" additiva o, pramix 

Tne Bnal enayme eoncentraUon in the die. ia .**, tha range of 0 01 200 m „ 
• P^r, per Kg die,, for example ,„ *. range o, 0.»S mg en*yme pro^^JT^ 

compos as soon is ^IST^T.^ ~ T "* ^ 
defames. ,a dosage in mg enzyme prol per Kg JT^"" °' — *" 
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The same principles apply for determining mg enzyme protein in feed additives. Of course 
,f a sampte ,s avaiiabie pf the protease used for preparing the feed additive or the feed the 

rriTT d :r ned from tws < n ° need to ■»■* *• ~ — - - 

composition or the additive). 

The present invention is further described by the following examples which should not be 
construed as-limiting the scope of the invention. 

Detergent P.omDosirintvt 

.- The protease of the invention may be added to and thus become a component of a 

detergent composition. The detergent composition of the invent may ,or exampie be formulated 
as a hand or machine laundry detergent composition Including a iaundry additive composition 
suttable .for pre-treatmen, of stained fabrics and a rinse added fabric softener composition or be 
formu ated as a detergent composition for use in genera, household hard surface Cea^ 
operabons. or be formulated for hand or machine dishwashing operefions 

» . in a spedfio aspect, the invention provides a-detergent addifive comprising the protease of 
«te nvenbon. The detergentiaddiuve as we,, as me' detergent compost ma, compose : one o, 
more otner.enrymes such as another protease. such as aikaline proteases from Bacillus, a lipase; 
a afirnase. an amy,ase. a cerbohydrase. a ce,,u,ase. a pecfinase. a mannanase. an arabinase a 
galactanase. a xyianase. an oxidase. e.g.. a I accasa. and/or a peroxidase 

detent 9 r ra u he PrOPertfeS Ch ° S *" enZyme<5) S> ™*> te ■» the selected 

eteH^ , h ' mUm ' C ° mPa * bilHy ^ en2) " na,iC and "•—■a™* ingredients, 

etc.). and the enzyme(s) should be present In effective amounts 

enoin SU " aDle ' iPaSeS inClUdS U,0Se " <* °"9ln. Chem,ca,,y modified or protein 

are ,ncluded - Examftes °* useft " ,,paaas Muda *— *™ «^** 

trTJ£,7 m "* eSl e ^ * ' a " U9 " ,OSa (r as described in EP 253068 

and EP 3052,6 or from H.Jnsotons as described in WO 06/13580. a Pseuddmonas lipase, e g 

GB 7£ZTp P ' Pae "* afca * anaa < EP 21 °" 2 >- " ^Pacfa (EP 331376,. P. rfrjf 
l^onl T ^ PSeUtf0m0naS * **> » 705 (WO 95*6720 and WO 

(1^31 B h T"""""* *» 96/12012 '' a *— *-* e.9. torn a auhffl/s (Dariois e, a,. 
(1993). aochem,oa e, Btophysica Acta. 1131, 253-360). a steameWcp/rte (JP 64/744992) or 
a pomr/os (WO 91/16422). Other examp,ea are fipase variants such as those described in WO 
92*5249. WO 94/01541. EP 407225. EP 260,05. WO 95,35381. WO 96,00292 WO 95^7« 
WO 94*5578. WO 95,,4763. WO 95.26,5. WO 97,04079 and WO 97*7202 P^ 
commeraa„y ava„ab,e „pase enzymes include Upo,asa™and Lipolase Ultra™ (Novoiymas A*) 
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Suitable amylases (alpha- and/or beta-) include those of bacterial or fungal origin 
Chemically modified or protein engineered mutants are included. Amylases include, for example 
a.pha-amy.ases obtained from BaoiUus, e.g. a special strain of 8. iicheniformis, described in more 
deta.l ,n GB 1 ,296,839. Examples of useful amylases are the variants described in WO 94/02597 
WO 94/18314, WO 96/23873. and WO 97/43424, especially the variants with substitutions in one 
or more of the following positions: 15. 23. 105, 106. 124, 128. 133. 154. 156, 181 188 190 197 
202. 20* 209, 243. 264. 304. 305. 391. 408. and 444. CommerciaHy avai.ab«e amylases are 
Duramyl , Termamyl™, Fungamyl™ and BAN- (Novozymes A/S), Rapidase™ and Purastar™ 
; (from Genencor International Inc.). 

Suitable cellulases include those of bacterial or fungal origin. Chemically modified or 
prote.n engineered mutants are included. Suitable cellulases include cel.ulases from the genera 
Bad//*, Pseudomonas, HumicCa, Fusanum, Tn/e/awa, Acremonium, e.g. the funga. ceHulases 
produced from Humicota insotens. MyceHophthora thermopMa and Fusanum oxysporum 
d,sclosed in US 4,435,307, US 5.648.263. US 5.691.178. US 5.776.757 and WO 89/09259 
Especially suitab.e ceHulases are the -alkaline or neutral cellulases having colour care benefits'- 
,-Examples of such cellulases are cellulases described in EP 0 495257, EP 531372 WO 96/1 1262 
• WO 96/29397, WO 98/08940. Other- examples are cel.u.ase variants such as tho'se described in 
WO 94/07998. EP 0 531 315. US 5.457.046. US 5.686.593. US 5.763.254. WO 95/24471 WO 
98/12307 and WO 99/01544. Commercially available cellulases include Cel.uzyme™ ' and 

^rTnLl N0V ° 2ymeS ^ C,a2 ' naSe ™' ^ Pur3daX HA ™ (^encor .nternational Inci. and 
KAC-500(B)™(Kao Corporation). 

Suitable peroxidases/oxidases include those of plant, bacterial or fungal origin. ChemicaLy 
modrfied or protein engineered mutants are included. Examples of usefu. peroxidases include 
perox,dases from Coprinus, e.g. from C. cinersus, and variants thereof as those described in WO 
93/24618. WO 95/10602. and WO 98/15257. Commercially available peroxidases include 
Guardzyme™ (Novozymes). 

The detergent enzyme(s) may be included in a detergent composition by adding separate 
add,t,ves containing one or more enzymes, or by adding a combined additive comprising all of 
these enzymes. A detergent additive of the invention, i.e. a separate additive or a combined 
add,t,ve, can be formulated e.g. as a granulate, a liquid, a slurry, etc. Preferred detergent additive 
ormulauons are granulates, in particular non-dusting, granulates. Hquids, in particular stabilized 
liquids, or slurries. 

Non-dusting granulates may be produced, e.g.. as disclosed in US 4,106.991 and 
4,661.452 and may optionally be coated by methods known in the art. Examples of waxy coating 
materials are polyethylene oxide) products (po.yethy.eneg.yco.. PEG) with mean molar weights of 
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1000 to 20000; ethoxylated nonylphenofs having from 16 to 50 ethylene oxide units; ethoxylated 
fatty alcohols in which the alcohol contains from 12 to 20 carbon atoms and in which there are 15 
to 80 ethylene oxide units; fatty alcohols; fatty acids; and mono- and di- and triglycerides of fatty 
ac.ds. Examples of filnvforming coating materials suitable for application by fluid bed techniques 
are given in GB- 1483591. Liquid enzyme preparations may, for instance, be stabilized by adding a 
polyol such as propylene glycol, a sugar or sugar alcohol, lactic acid or boric acid according to 
established methods. Protected enzymes may be prepared according to the method disclosed in 
EP 23821 6. 

The detergent composition of the invention may be in any convenient form, e.g.. a bar a 
tablet, a powder, a granule, a paste or a liquid. A liquid detergent may be aqueous, typically 
containing up to 70 % water and 0-30 % organic solvent, or non-aqueous. 

The detergent composition comprises one or more surfactants, which may be non-ionic 
including semi-polar and/or anionic and/or cationic and/or zwitterionic. The surfactants are 
typically present at a level of from 0.1% to 60% by weight. 

> ■ When included therein the detergent will usually contain from about 1 % to about 40%_of an 
an.on.c surfactant such as linear alkylbenzenesulfonate. alpha-olefinsulfonate, *, ky , sulfate {fatty 
Alcohol sulfate), alcohol ethoxysulfate.. secondary alkanesulfonate, alpha-sulfo -fatty acid methyl 
ester, alkyl- or alkenylsucdnic acid or soap. 

When included therein the detergent will usually contain from about 0.2% to about 40% of 
a non-.on,c surfactant such as alcohol ethoxylate, nonylphenol ethoxylate. alkylpolyglycoside 
alkyld,methylamineoxide, ethoxylated fatty acid monoethanolamide. fatty acid monoethanolamide 
polyhydroxy alkyl fatty acid amide, or N-acyl N-alkyl derivatives of glucosamine ("glucamides") 

The detergent may contain 0-65 % of a detergent builder or complexing agent such as 
zeolite, diphosphate, triphosphate, phosphonate. carbonate, citrate, nitrilotriacetic acid 
ethylenediaminetetraacetic acid, diethylenetriaminepentaacetic add, alkyl- or alkenylsucdnic acid' 
soluble silicates or layered silicates (e.g. SKS-6 from Hoechst). 

The detergent may comprise one or more polymers. Examples are 
carboxymethylcellulose, poly(vinylpyrro.idone), poly (ethylene glycol), polyvinyl a.cohol) 
poly(vinylpyridine-N-oxide), polyfvinylimidazole), polycarboxylates such as polyacrylates 
malerc/acrylic acid copolymers and lauryl methacrylate/acrylic acid copolymers. 

The detergent may contain a bleaching system which may comprise a H 2 O z source such 
as perborate or percarbonate which may be combined with a peradd-forming bleach activator 
such as tetraacetylethylenediamine or nonanoyloxybenzenesuJfonate. Alternative^, the bleaching 
system may comprise peroxyadds of e.g. the amide, imide, or sulfone type 
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The enzyme(s) of the detergent composition of the invention may be stabilized using 
conventional stabilizing agents, e.g., .a polyol such as propylene glycol or glycerol, a sugar or 
sugar alcohol, lactic acid, boric acid, or a boric acid derivative, e.g.. an aromatic borate ester, or a 
phenyl boronic acid derivative such as 4-formylphenyl boronic acid, and the composition may be 
formulated as described in e.g. WO 92/1 9709 and WO 92/1 9708. 

The detergent may also contain other conventional detergent ingredients such as e.g. 
fabric conditioners including clays, foam boosters, suds suppressors, anti-corrosion agents, soil- 
suspending agents, anti-soil redeposition agents, dyes, bactericides, optical brighteners. 
; hydrotropes. tarnish inhibitors, or perfumes. 

It is at present contemplated that in the detergent compositions any enzyme, in particular 
the enzyme of the invention, may be added in an amount corresponding to 0.01-100 mg of 
enzyme protein per liter of wash liqour, preferably 0.05-5 mg of enzyme protein per liter of wash 
liqour, in particular 0. 1 -1 mg of enzyme protein per liter of wash liqour. 

The enzyme of the invention may additionally be incorporated in the detergent formulations 
•disclosed in WO 97/07202. 

•• . • The. invention described and claimed herein is not .to be limited in scope by .the-specific 
embodiments herein disclosed, .since these embodiments are intended as illustrations of several 
aspects of the invention. Any equivalent embodiments are intended to be within the scope of this 
invention. Indeed, various modifications of the invention in addition to those shown and described 
herein will become apparent to those skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of the appended claims. In the case of 
conflict, the present disclosure including definitions will control. 

Various references are cited herein, the disclosures of which are incorporated by reference 
in their entireties. 

EXAMPLES 
Materials and methods 

Strains: 

Bacillus subtilis PL1801 (Diderichsen. B et al. 1990. Cloning of aldB, which encodes 

alpha-acetolactate decarboxylase, an exoenzyme from Bacillus brevis. J. Bacteriol 

172.4315-4321) 

Bacillus subtilis MB1053 

Bacillus subtilis PL3598-37 

Bacillus subtilis MB1510 
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BaciUus subtilis PL2306. This strain is the B.subtilis DN1885 with disrupted aprand npr 
genes (Diderichsen, B., Wedsted. U., Hedegaard. L, Jensen, B. R., Sjoholm. C. (1990) 
Clonrng of aldB. which encodes alpha-acetolactate decarboxylase, an exoenzyme from 
BacUus brevis. J. Bacterid., 172, 4315-4321) which is also disrupted in the 
transcriptional unit of the known BaciHus subtilis cel.ulase gene, resulting in cel.ulase 
negative cells. The disruption was performed essentially as described in (Eds A L 
Sonenshein, J.A. Hoch and Richard Losick (1993) Bacillus subtilis and other Gram- 
Positive Bacteria, American Society for microbiology, p.618). 

10 Procedure for isolating genomic DMA 

Harvest 1 .5 ml culture and resuspend in 100 pi TEL. Leave at 37C for 30 min 
Add 500 pi thiocynate buffer and leave at room temperature for 1 0 min. 

Add 250 pi NH4Ac and leave at ice for 10 min. . . 
Add 500 pi CIA and mix. 

Transfer to a microcentrifuge and spin for 1 0 min. at full speed • - • 

Transfer supernatant to a newEppendorf tube and add 0.54 volume co.d isopropanol. Mix - - * * 

thoroughly. ~ „ , . 

Spin and wash the DNA pellet with 70 % EtOH. 
Resuspend the genomic DNA in 100 pi TER. 
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TC: 10 mM Tris-HCI, pH 7.4 

1 mM EDTA. pH 8.0 
TEL: 50 mg/ml Lysozym in TE-buffer 

Thiocyanate: 5M guanidium thiocyanate 

100 mM EDTA 

0.6 % w/v N-Iaurylsarcosine, sodium salt. 
60 g thiocyanate, 20 ml 0.5 M EDTA, pH 8.0. 20 ml H20 
dissolves at 65C. Cool down to RT and add 0.6 g IM- 
laurylsarcosine. Add H20 to 100 ml and filter it through a 0.2 p 
sterile filter. 

NH4Ac: 7.5 M CH3COONH4 

TER: 1 Mg/mlRnaseAinTE-buffer 

CIA: ChloroformAisoamyl alcohol 24:1 



Purification of PCR bands a r» d DNA seoi<ftnri n n 
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PCR fragment can be purified using GFX™ PCR DNA and Gel Band™ Purification Kit 
(Pharmacia Biotech) according to the manufacturer's instructions. The nucleotide sequences of 
the amplified PCR fragments are determined on an ABI PRISM™ 3700 DNA Analyzer (PerWn 
Elmer, USA) using 50-100 ng as template, the Taq deoxy-terminal cycle sequencing kit (Perkin 
Elmer. USA), fluorescent labeled terminators and 5 pmol of the sequencing primer of choice. 

Media 

TY: (As described in Ausubel. F. M. et al. (eds.) "Current protocols in Molecular Biology". John 
; Wiley and Sons, 1 995). 

LB agar (As described in Ausubel. F. M. et al. (eds.) "Current protocols in Molecular Biology". 
John Wiley and Sons, 1995). 

LB-PG agar is LB agar supplemented with 0.5% Glucose and 0.05 M potassium phosphate! pH 
7.0. 



Proteolytic activity - • ; • 

v S2A protease activity .is measured using .the PNA assay with succinyl-alanine-alanine? 

proline-phenylalnine-paranitroanilide as a substrate unless otherwise mention. The principle of the 
PNA assay is described in Rothgeb, T.M.. Goodlander. B.D.. Garrison. P.H.. and Smith, LA, 
Journal of the American Oil Chemists' Society. Vol. 65 (5) pp. 806-810 (1988). 

Gene expr ession in Bacillus subtifis host 

All the expressed genes in the following examples are integrated by homologous 
recombination on the Bacillus subMs host cell genome. The genes are expressed under the 
control of a triple promoter system (as described in WO 99/43835). consisting of the promoters 
from Bacillus licheniformis alpha-amylase gene (amyL), Bacillus amyloliquefaciens alpha-amylase 
gene (amyQ), and the Bacillus thuringiensis crylllA promoter including stabilizing sequence. The 
gene coding for Chloramphenicol acetyl-transferase was used as maker. (Described in eg. 
Diderichsen.B.; Poulsen.G.B.; Joergensen.S.T.; A . useful cloning vector for Bacillus subtilis. 
Plasmid 30:312 (1993)). 

Example 1. Construction of synthetic 10R tail-variant genes with Savinase signal 

A synthetic 1 0R gene (1 0RS) encoding a S2A protease denoted 1 0R from Nocardiopsis 
sp. NRRL 18262 having the amino acid sequence shown in SEQ ID NO: 43 (WO 01/58276) was 
constructed, which has the nucleotide sequence shown in SEQ ID NO: 1. This synthetic gene was 
fused by PCR in frame to the DNA coding for the signal peptide from SAVINASE™ (Novozymes) 
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resulting ,n the coding sequence S av -10RS which is shown in SEQ ID NO: 2. Several tail-variants 
of th.s construct were made. Compared to the Sav-10RS protease encoded by SEQ ID NO-2 the 
ta.l variant construct Sav-10RS HVO was constructed to have 8 amino acids extra in the C- 
terminus: QSHVQSAP (SEQ ID NO: 3) which were encoded by the fo.lowing DMA sequence 
5 extension inserted in front of the TAA stopcodon of SEQ ID NO: 2: 
(SEQ ID NO: 4): caatcgcatgttcaatccgctcca 

Tail variant Sav-10RS HV1 was constructed to have 4 amino acids extra in the C- 

, m IrT* QSAP (SEQ ID 51 ^ thS f0 " OWin9 DNA extensi °" -^rted in front of 

k 10 the TAA stopcodon: 

(SEQ ID NO: 6): caatcggctcct 

Tail variant Sav-1 ORS HV3 was constructed to have 2 amino acids extra in the C- 
terminus: QP (SEQ ,D NO: 7) with the following DNA sequence extension inserted in front of the 
15 TAA stopcodon: 

' ,! : *• * * (SEQ ID NO: 8): caacca 

u • ..... ; l ; ■ •• ■ - ! - ; 

Tail variant Sav-10RS HV2 was constructed to have one amino acid extra in the C 
terminus: P (SEQ ID NO: 9) with the following DNA sequence extension inserted in front of the 
20 TAA stopcodon: 

(SEQ ID NO: 10): cca 

I The 1 0RS gene and the four tail-variant encoding genes were integrated by homologous 

recomb.nation into the BaciVus suttilis MB1053 host cel. genome. Chloramphenicol resistant 
25 ransformants were checked for protease activity on ,% skim milk LB-PG agar plates 
(supplemented with 6 ug/ m , chloramphenicol). Some protease positive colonies were further 
analyzed by DNA sequencing of the insert to ensure the correct gene DNA sequence, and five 
strams each comprising one of the above constructs, were selected and denoted, respectively 

,n tT , S> B SUbmS SaV - 1 ° RS m °' B SUbm ' S SaV - 10RS B s»™* Sav-10RS 

30 HV2 and B.subtilis Sav-10RS HV3. 

Example 2. Fermentation yields of 10R tall-variants with Savinase signal 

Fermentations for the production of the tail-variant enzymes of the invention were 
performed on a rotary shaking table in 500 ml baffled Erlenmeyer flasks each containing 100 ml 
35 TY supplemented with 6 mg/| chloramphenicol. 
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Six Erlenmeyer flasks for each of the five B. subtilis strains from example 1 were 
fermented in parallel. Two of the six Erlenmeyer flasks were incubated at 37X (250 rpm), two at 
30-C (250 rpm), and the last two at 26»C (250 rpm). A sample was taken from each shake flask at 
day 1, 2 and 3 and analyzed for proteolytic activity. The results are shown in tables 1-3. As it can 
be seen from tables 1 -3, the effect of the tails is a surprisingly high improvement on the 
expression level of the protease, as measured by activity in the culture broth. The effect is most 
pronounced at 26'C and 30«C, but is also evident at 37'C as an effect observed especially at the 
early stage of the fermentation. 
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Table 1: Relative proteolytic activities at 37»c. 





Day 1 


Day 2 


Day 3 


Sav-10RS 


1.0 


1,0 


1.0 


Sav-IORS HVO 


3,3 


0.7 


0.8 


Sav-10RS HV1 


4,7 






Sav-10RS HV2 


2.2 


0.6 


0.4 


Sav-10RS HV3 


5.3 


. 1.4 


V 


Table 2: Relative proteolyti 


c activities 
Day 1 * 


5at30°c." 

Day 2 • 


Day 3 


Sav-10RS 


1,0 


1.0 


1.0 


Sav-10RS HVO 


1.7 


2.2 


2.9 


Sav-10RS HV1 


4,6 


3.1 


4.9 


Sav-10RS HV2 


2.4 


1,9 


2,3 


Sav-10RS HV3 ] 


4,8 


3,0 


4.4 


Table 3: Relative proteolyti 


c activities 
Day 1 


; at 26°C. 
Day 2 I 


Day 3 


Sav-10RS 


1.0 


1.0 


1.0 


Sav-10RS HVO 


1,8 


2.5 


3,1 


Sav-10RS HV1 


2.5 


3.6 


4,3 


Sav-10RS HV2 


1,8 


2.6 


2.8 


Sav-10RS HV3 


2,6 , 


. 3.5 _ 


4,6 



Example 3. Chromosomal integration of tail-variant genes 

The following construct was used for the chromosomal integration of the tail-variant 
encoding genes. The coding sequence of the well-known subtilisin BPN 1 protease was 
operationally linked .to a triple promoter, a marker gene was fused to this (a spectinomycin 
resistance gene surrounded by resolvase res-sites), and pectate lyase encoding genes from 
Baallus subUhs were fused to the construct as flanking segments comprising the 5" polynucleotide 
region upstream [yfmD-ytmC-yfmB-yfmA-Pel-start]. and the 3' polynucleotide region downstream 
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[Pe^nd-yflS-citS(start)] of the tail-variant encoding polynucleotide, respective*. The integration* 
cassette was made by the joining of several different PGR fragments. After the final PCR reaction 
the PCR product was used for transformation of naturally competent B. subtilis cells. One clone 
denoted PL3598-37 was selected and confirmed by sequencing to contain the correct construct. 

The PL3598-37 clone thus contains the following: 

1 . The flanking regions 100% homologous to region of the B.subtilis genome (appears as the 
upstream fragment yfrnD-ytmC-yfrnB-yfrnA-Pelstart and the downstream fragment Pel-end- 
yflS-citS(start)). 

2. The Spectinomycin resistance gene flanked by Resolvase sites (res). 

.3. The triple promoter region plus CrylllA mRNA stabilising leader sequence. 
.4. The BPN' Open Reading Frame. 

Construction of triple promoter RPN 1 cassette 

A PCR fragment comprising the integrational cassette for a BPN' library was 
constructed, thus operably linking a triple promoter (as described in WO 99/43835; Novozymes) to 
a BPN' expression cassette from .a BacWus strain. The triple promoter is a fusion of an optimized 
BaMus amyL-derived promoter (as shown in WO 93/10249; Novozymes) with two promoters 
scBAN and cry.HA, where the first is a consensus version of the BacWus amyldquefaaens 
amylase BAN promoter, and the latter includes a mRNA-stabi.ising sequence (as described in WO 
99/43835; Novozymes). Suitable primers can be derived from the publicly available sequences 
(Vasantha, N. et al. Genes for alkaline protease and neutral protease from Bacillus 
amylolwefaciens contain a large open reading frame between the regions coding for signal 
sequence and mature protein. J. Bacterid. 159:811 (1984) EMBL accession No. K02496) A Kpnl 
and a Sail restriction site was introduced to flank the PCR fragment at each end. using the 
primers: 

#252639 (SEQ ID NO: 11): catgtgcatgtgggtaccgcaacgttcgcagatgctgctgaagag 
#251992 (SEQ «D NO: 12): catgtgcatgtggtcgaccgattatggagcggattgaacatgcg 

The Kpnl and Sail restriction sites in the PCR fragment were subsequently used to clone 
the fragment into a Kpnl-Sal. digested Peel-Spec PCR fragment The Peel-Spec fragment 
compnses a Spectinomycin resistance gene inserted in the middle of the B.subtilis Pectate lyase 
gene plus approx. 2.3 kb of upstream genomic DNA and approx. 1.7 kb downstream genomic 
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DNA. The Peel-Spec fragment was produced bv Pm am «nfi~,.- 

B,utms stra* M B,053. ueing the prim ^ °' ^ ,he 

™ IS S 13>: 9C9tt9a9a ^^cc9C 9ag c 9 cc 9 « 9g c g aa, 9 a te c 
#179542 (SEQ ID NO: 14): gcgttgagacagacgagcagggaaaaatggaaccgcmttc 

Construct ion of MB 1053 

• Crouch ,„T ** ^ by deleti °" - ,he CPeO gene 

. trough mtegrahon of a PCR product into a wild-type B.sMlis frpes**, ge „or„e This was 

' - ~ <™ *~» and upsfream tfl 

Pectate lyase aene of the* R *hh*u:« v 1 ine 



etono* JT T 9e "° miC DNA PreCed,n9 and "*»••*<» *• Pel gene were 

d i ed h r;° U9h / nmer ^ * • 100% homologous ,o DNA Lguanl 

. — ■ «- 1-** — Ihe marker gana <Spao, conferred resistance tospLnomyT 
. , and . was seated hefween Iwo Res sltas. a»oge,her prasan, on the plasmid pwitZS 
■ In US patent No. 5,882,888). Three different PCR fragments were initially produced. •> 

Fragment 1: this fragment covers from the yfmD gana to the middle of the Pet gana and 

T ° Uert,an9 '° — - *» Pet gene. The s«e of Z 
2.8 kb. The fragment was produced bv a pgr amn ix^- u 9 1 ,s 

at* PL2308, using th. pLTrs «•"»»»- DNA ** m * 
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#179541 (SEQ ID NO: 13), and 

#179539 with overlap to #179154 Spec primer (SEQ ID NO: 15): 

ccatttgatcagaattcactggccgtcgttttacaaccattgcggaaaatagtcataggcatcc 

a F nTrr : fra9ment ^ ^ ^ midd ' e ° f thB Pe ' 9ene to after ^ — of the CKS gene 
and ^troduong an overhang to the Res^pec-Res cassette at the midd,e of the Pe. gene The 

d"a fLm Z 1 * ^ ^ ^ Pr ° dUCed by 3 PCR a ™°" * chlosoma 
DNA from the B.subtilis strain PL2306, using the primers: 

#179542 (SEQ ID NO: 14), and 

#179540 with overlap to #179153 Spec primer (SEQ ID NO: 16): 

ggatccagatctggtacccgggtctagagtcgacgcggcggttcgcgtccggacagcaca 
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Fragment 3: this fragment contains the Spectinomycin gene surrounded by Res sites and DNA 
sequences in the ends overlapping with PGR fragment 1 and 2. The size of fragment 3 is 1 6 kb 
Fragment 3 was produced by PGR amplification of plasmid DSJ3358. using the primers: 

#179154 (SEQ ID NO: 17): gttgtaaaacgacggccagtgaattctgatcaaatgg 
#179153 (SEQ ID NO: 18): ccgcgtcgacactagacacgggtacctgatctagatc 

Standard conditions fn r the PCR roar*™ 

For the PCR amplifications of fragment 1-3 the HiFi Expand™ PCR system (Roche) was 
used together with the following cycling scheme: 
5 pi Buffer 2 

14 pi dNTP's (1.25 mM each) 
2.5 ud 20 pM primer 1 
15 2.5 pi 20pM primer 2 ... 

: . - ' x pi water :,:••■. 

• To this mix 3 pi of DNA (apx. 100 ng) and 0.75 pi Enzyme mix (use hot start) is.added " \ . 
Total volume is 50 pi. 
The cycling profile is: 
20 1 cycle of 120 sec at 94*C 
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Break. 



10 cycles of 15 sec at 94'C, 60 sec at SOX, 240 sec at 72C. 

20 cycles of 1 5 sec at 94'C. 60 sec at 60'C, (180 sec at 72*C add 20 sec pr cycle) 

1 cycle 600 sec at 68*C. 



The three PCR fragments were made and joined in later JOINING-PCR reactions. The three PCR 
fragments were single sharp bands and no gel purification was necessary. Only Qiagen™ PGR 
purification was performed prior to the following JOINING-PCR. 
JOINING of fragment 1 + 3 (same procedure for fragment 2 + 3): 
30 5 pi Buffer 2 

8 pi dNTP's (1 .25 mM each) 
5.0 pi Fragment 3 
5.0 pi Fragment 1 
' 9.25 pi water 
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1 cycle of 120 sec at 94*C. 
Break. Add Enzyme 

10 cycles of 15 sec at 94'C, 60 sec at 60*C. 240 sec at 72'C. 
Break. Add Primers 

5 1 5 cycles of 1 5 sec at 94'C, 60 sec at 60'C,(1 80 sec at 72'C add 20 sec pr cycle) 

1 cycle 600 sec at 68*C. 

After the first cycle at 94'C for 120 sec there is a break, where 0.75 pi Enzyme mix is added. 
; Total volume is now 45.0 pi. 
10 After the initial 10 cycles, there is another break in the cycling and for fragment 1+3: 2.5 ul (20pM 
#179541) and 2.5 ul (20 pM #179153) are added and for fragment 2+3: 2.5 ul <20uM #179542) 
and 2.5 ul (20 pM #179154) are added and the cycling is continued for 15 cycles more. 

The PCR products were then gel purified: The size of fragment 1+3 should be 3.4 kb 
15 and the size of fragment 2+3 should be 3.4 kb. These two fragments were joined in a- last PCR 
- reaction (Expand™ long system, Roche): ..... 

5 pi Buffer 1 

14 pi dNTP's (1.25 mM each) 
20 5.0 pi Fragment 1+3 

5.0 pi Fragment 2+3 
17.75 pi water 

After the first cycle at 94»C for 120 sec there is a break, where 0.75 pi Enzyme mix is added. 
25 Total volume is now 45.0 pi. 

After the initial 10 cycles, there is another break in the cycling and 2.5 pi (20pM #179541) and 2.5 
pl (20 pM #1 79542) is added and the cycling is continued for 1 5 cycles more. 

1 cycle of 120 sec at 94'C. 
Break. Add Enzyme 

30 10 cycles of 15 sec at 94'C, 60 sec at 60'C, 240 sec at 68'C. 

Break. Add Primers 

1 5 cycles of 1 5 sec at 94*C, 60 sec at 60'C, 1 80 sec at 68'C add 20 sec pr cycle 
1 cycle 600 sec at 68'C. 
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The size of the joined PCR fragment is 6.8 kb. This PCR fragment was purified using a 
Qiagen™ PCR purification kit, and 5 M l of the 50 gl eluted DNA was used to transform a standard 
B.subtilis strain. After transformation cells were spread onto LBPG-120ug/ml of spectinomycin. 
Next day more than 1000 colonies were seen. 8 of these were checked using PCR primers from 
last JOINING PCR amplification yielding PCR fragment of 6.8 kb rather than the 5.2 kb expected if 
deletion had not occurred. Furthermore, the pectatelyase activity of the clones was checked with 
the Mancini Immunoassay, which showed no reactivity towards the pectatelyase activity. This 
taken together with the Spec resistance tells us that deletion had occurred. One such clone was 
; selected and denoted MB1053. 

Insertion of BPN' expression cassette ad v ent to the res-spec-res in MB10S3 

The ligation mix of the digested PCR amplified triple promoter BPN' expression cassette 
and the Kpnl-Sal digested Ped-Spec PCR fragment was used as template in a PCR amplification 
using the PCR primers #179541 and #179542. This resulted in a PCR fragment of approx. 9 kb, 
which was used to transform B.subtilis PL1801 (Diderichsen. B et al. 1990. Cloning of aldB. which 
encodes alpha-acetolactate decarboxylase, an exoenzyme from Bacillus brevis. J. Bacteriol., 172. • 
4315-4321) competent cells. The transformed cells were plated on LB-120 ug/ml Spectinomycin -. 
agar plates with skim milk. Spectinomycin resistant colonies with large skim milk clearing zones 
were restreaked on Spectinomycin agar plates and analysed for the integration of the PCR 
fragment with PCR using the primers #179541 (SEQ ID NO: 13) and #179542 (SEQ ID NO: 14). 

Appearance of a 9 kb fragment indicates that the PCR fragment has been integrated 
into the host cell genome. Several of these clones were sequenced to confirm integration of the 
expression cassette, one such clone was selected and denoted PL3598-37. 

Example 4. Construction of plasmid-borne chromosomal integrational cassette 

An E.coli plasmid-bome integrational cassette for a library may be constructed In vivo. 
An integration cassette to be used according to the method of the invention may be present on a 
E.coli plasmid (which is capable only of replication in E.coli. not in B.subtilis), the plasmid 
comprising: 

i) The DNA sequence encoding the Pre-Pro-domains of the subtilisin protease 
commonly known as Savinase, preceded by and operably linked to 

ii) a DNA sequence comprising a mRNA stabilising segment derived in this particular 
case from the Cryllla gene; 

iii) a marker gene (a chloramphenicol resistance gene), and 
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iv) genomic DMA from BacWus subtilis as and v 
5- po ly nudeotide region upstream of tJ-T f . ! 9 Se9TOiMS: The """""So"" 

- r po*nuJL ^1^7^ ,^^^^ 
respective*, """"stream of the polynucleotide [Pe^nd-ySS^aart)]. 

5 and PCR^nnTT 6 ? ^ ^ SeVeral Cl0n ' n9 **» inUOlVin9 * PUC19 ptasmid 

zz r~;rzr mi rc; n r-r s,tes of — dw ^ ~ 

P,sm, tna ,iga,on mi„ U re J^LZZ^Z Si* ^ 

The PMB1508 plasmid thus contains the following- 

■"(rbs), op!^::::rr s,ab, " n9 ^ — - * *— — 

Hi) The chloramphenicol resistance operon- ' ' ' * ' 

37254- v/" 6 TT ^ " 5,ed Wa ' e d0ned > UC1S » ««*»• ('"Wed from ««* ATCC 

urns, ouuuiis ibe BGSC accession number 1A1 ifiR tmr? tw« ♦ 

i) Tne triple promoter and the mRNA stabilising element 

emf-yn^s^Jdot^r: of the .npie-promote, and t. polynucfeotide region [Pe,- 
v n,j aownstream of the mRNA stabilizing element. 
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Thus, when using MB1 510 competent cells, it is possible for the P MB1508 (or 
derivatives thereof) to directly integrate into the genome of MB1510 where the two flanking 
reg.ons in fusion with the triple-promoter and mRNA stabilising element isJocated. resulting in a 
construction where the incoming PrePro encoding DNA of P MB1508DNA has been integrated in 
the correct reading frame with the tripel-promoter. the mRNA stabilising element and the RBS 
Thus resulting in high expression of the integrated gene from the promoter elements already 
present on the genome of MB1 510. 

Transformation effeciency was established for the B.subtilis strain MB1510 transformed 
; with E.coli prepared plasmid pMB1508. For further testing of the potential of using this approach 
the Savinase encoding gene of Bacillus clausii was PGR amplified using the two PGR primers: 

Primer#317 (SEQ ID NO: 19) tggcgcaatcggtaccatgggg 

Primer #139 Notl (SEQ ID NO: 20) catgtgcatgcggccgcattaacgcgttgccgcttctgcg 

• The resulting -0.8 kb of the Savinasefragment and the pMB1508 plasmid are digested 
wrth Kpnl and Notl, and the resu.ting fragments are then.purifiied by agarose gel electrophoresis 
The two fragments are ligated, and the ligation mixture is used to transform competent E coff cells 
wh.ch are then plated on LB-agar plates or placed in liquid media for growth overnight at 37'C- 
both types of media containing 50-100ug/m. of Ampicillin. After incubation, a plasmid prep is made 
of the ..quid culture. The purified plasmid is used for transformation of competent cells of MB1510 
(using 100-10.000 ng of plasmid per transformation. The transformed cells are plated onto TY 
medium with 2% skimmi.k and 6 ug/ml of ch.oramphenicol for selection. After overnight incubation 
at 37 C cleanng zones appear around those colonies wherein the integration cassette is 
integrated properly into the cells, indicating high Savinase expression. 

This approach can also be used to make highly diverse libraries of any gene of interest 
expressable in B.subtilis, where rather than a gene encoding one enzyme, any expressable 
polynucleotide is inserted into the plasmid P M81508 and integrated into the MB1510 strain for 
subsequent screening. 

Sequence of plasmid p MB150ft ( R PQ |p nq : 21) 

The plasmid P MB1 508 has the following components, indicated by basepair positions: 

BP 51 86-395: P UC1 9 sequence from E.coli clone ATCC 37254. Vieira J, Messing J The 
PUG p«asmids. an M13m P 7-derived system for insertion mutagenesis and sequencing with 
synthetic universal primers. Gene 19: 259-268. 1982. 
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ele^nt iT 3 T™' EC ° R ' ° ,0nin9 SitB ( BP396 ^ 01 ) «"d the Crv.HA mRNA stabilising 
element. (Descnbed in WO 9634963-A1) «>"»««ng 

(Pre-Pro Z ** ° f a " d the **0 

ST hT« " *° 9623073 - A1 ' ,he NM Site and the between the Pre- 

Pro and A/o/l was introduced by the PCR primer. 

transfer^ ^"T^ ^ " C, ° n,n9 "* (BP1413 - 1418 > ™» the Chloramphenicol acetyl 
transferase operon of pDN1050 (Described in eg. Diderichsen.B, Pou.sen.G.B, Joergensen S T ■ 
A useful cloning vectorfor Bacillus subtilis. Plasmid 30:312 (1993)). 

' nHR i BP f 2 l 13 "! 185: PO,ynudeotide re 9 jo " IPel-end-yflS-citStstart)] downstream of the 
dal s f°p? BSUmS ^ (3S *~» <™ *• P«on and correspond g 

CZoTete N ' ° 9aSaWara ' '■ MOSZer ' <146 ° ther aUth ° rS> ' ^ Y <~ * Oanchin 

The complete genome sequence of the Gram-positive bacterium Bacillus subtilis" 

Nature (1997) 390:249-256). 

15 ' The Bacillu s subtilis strgjn MBIfim , t 
: MB1510hasthefollowingspecificfeaturesinandaroundthepelBlocus- • 
' seluen^! 6 ^ ^ ^ Eluding a RBS (Ribosome .binding 

IbTapTTT C ° mPriSin9 ^ f0,l0VW ' n9 h ° m0,O9OUS P^-leotide region fyfmD-ytmC- 

citSfsial ° f ^ Wp,e -~' and *. Polynucleotide region pLnd^S- 

ctS(start)] downstream of the mRNA stabilizing sequence. 

Sequence of MR1 51 n genomjcj jpiggiitioQ ^.n , fSEQ (n Mr> . „ } 

yfmA-Pel L 1 !? 1 C ° rreSPOndS t0 SeqUenCe ° f Badl,US SUbti ' iS 9en ° me ^ytmCyfmB- 
yfmA-Pel-s art (as * appeaars from the publication and corresponding database of: F. Kunst e t a. 

The complete genome sequence of the Gram-positive bacterium Bacillus subtilis" 
Nature (1997) 390:249-256). 

fDe^ri^!^ 3102 " 4 ^ 2 ' ^ tHPte Pr ° m0ter Cryl,,A mRNA Stabi,iSin 9 e,eme "t P'us RBS 
(Descnbed above in PL3598-37 construct). 

BP 4083-5718: The polynucleotide region [Pel-end-yflS-citS(start)] end of and 
downstream of the pe.B locus of the B.suMis genome (as H appeaars from the publication and 
correspond.ng database of: F. Kunst, N. Ogasawara. I. Moszer, <146 other authors^ 
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Example 5. Construction of a 2 amino-acid tail-variant library 

This example shows the construction of a tail-variant library. In this library two amino 

TJllTTt T he C ~ terminal of * e 10R protein - Such a ™^ *« b * w«h 

s ^tr 8 ?nr usin9 016 fo,,owin9 pcr pnmere in a pcr reacti ° n 

s DNAfromS.suW/teiORS as template: 

1605 (SEQ ID NO: 23): gacggccagtgaattcgataaaagtgc 

1606 (SEQ ID NO:24): ccagatctctatnktnktgtacggagtctaactccccaagag 

10 wherein N = A, C. G or T; and K = TorG. 
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and « ,. I"" ?' U,tln9 Pr ° dUCt ^ di96Sted With EC ° R ' and B * » and into EcoR I 

and Bgl II d,gested P MB1508. Hereafter following the principle described above. 

• Chloramphenicol .resistant Sac/flas rift transformants were picked by a robotic 
colony pick er from a bioassay plate . and transferred into a 384 we.l microtiter. plate (MTP) 
• conta^g 0.05 X TY supplemented with 6 mg/l chloramphenicol <60p,/wel<). The MTPs- were 
mcubated at 26X for 72h. After incubation each we,, was anar^ed for proteose activity. 

The thirty Bacillus subti/is transformants with highest proteolytic activity were se,ected 
for determ-nation of the two tai, amino acids in each transformed by DNA sequencing the 
sequencing results are summaries in table 4 and table 5. 



AA Tail 


No. of transformants 


TL 


4 


TT 


4 


QL 


3 


TP 


3 


LP 


r 3 


II 


2 


IQ 


2 


QP 


2 


PI. 


2 


. LT 


1 1 
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TQ 


1 


IT 


1 


QQ 


1 


PQ 


1 


Total 


30 



Table 4: column one shows the amino acid sequence of the tail, and column two shows the 
number of Bacillus subtilis transformants sequenced with that particular AA tail sequence. 



Possibilities position 1 


Result 


Possibilities position 2 


Result 


K 


0 


K 


0 


R 


0 


R 


0 


T 


14 


T 


6 


• ' I 


- ' 3' • 


1 


4 


Q' 


6. 


Q ' 


5 


P 


3 


P 


8 


L 


4 


L 


7 


Total 


30 


Total 


30 



Table 5: The table shows the amino acid- which could be introduced by the primer usedfor the 
l.brary construct and the actual findings by DNA sequencing of the thirty colonies isdated from 
screening. 
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Example 6. Construction of Bacillus subtilis strains L2, L2 HVO, and L2 HV1 

A Bacillus subtilis strain was made analogously with the construction of the Bacillus 
subtos strain 10RS, with the DNA coding for the preform of the S2A protease from Nocantiopsis 
dassonv,l,ei subsp. Dassonville* DSM 43235, denoted 12, fused by PGR in frame to the DNA 
eod.no for the signal peptide from SAV.NASE™ (a well-known commercial protease derived from 
Baalusclau,,. available from Novozymes, Denmark), the resulting strain was denoted Bacillus 
subtilis Sav-L2. 

The DNA seance including the coding region for the pro-mature S2A protease from 
Nocariopas dassonvillei subsp. Dassonv///ei DSM 43235, as amplified with primers 1423 and 
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1475 Is shown in SEQ ID NO: 25. The conesponxBnn encoded p,o-,orm a m ,no odd ee q uence for 
the 12 protease is shown in SEQ ID NO: 28. 

1423 (SEQ ID NO: 26): QctWagttoatcgatcgcatcggctgctccggcccocgtccccoag 
1475 (SEQ ID NO: 27): ggagcggattgaacatgcgattaggtccggatcctgacaccccag 

Two tail-variants of this construct were also made. Tail variant Sav-L2 HVO was 
constructed to have 8 amino acids extra in the C-terminus: QSHVQSAP (SEQ ID NO: 3). by using 
Z Tr S f qUenCe eXtenSi ° n fr ° nt ° f the ™ -topcodon which is shown in SEQ ID 

QSAP (slTNO S T b HV1 10 ^ 4 "*» "** ^ * - ^-terminus: 

QSAP (SEQ .D NO. 5), by us,ng the DNA sequence" extension inserted in front of the TAA 
stopcooon which is shown in SEQ ,D NO: 6. Both tail variants had the SAVINASE™ signal- 
pept.de encoding sequence fused in frame with the pro-mature encoding sequence, just like in 



The ^ 9*™ ^d the two tail-variants Sav-L2 HVO and Sav-L2 HV1 were integrated 
by homologous recombination on the Baa„us su b Ws MB1053 host cel. genome- as outlined 

IZ^TTTT ; eSiStant tranSf0rmantS «~ Checked:for P^ase activity on 1.%- skim 
m Ik LB-PG agar plates (supplemented- with- 6 ug/m. chloramphenicol). Some protease positive 
co.on.es were further analyzed by DNA sequencing of the insert to confirm the correct DNA 
sequence and one strain for each construct was selected and denoted B.su b W s Sav-L2 
B.subUUs Sav-L2 HVO, and B.subtilis Sav-L2 HV1 . respectively. 

Example 7. Fermentation yields of the Bacillus strains of example 6 

500 ml I^ffl^!' Str3inS ° f SXamP,e 6 ' ferme " ted - a -t a ry shaking table in 

chl h . " meyer ^ C ° ntainin9 100 m ' " -PP'^ented with 6 mg,, 
7 h S ' X Er ' enmeyer fl3Sks for -* « «ie three B. suMIs strains were fermented in 

paranel. Two of the sbc Erienmeyer flasks were incubated at 37'C (250 rpm), two at 30"C (250 
rpm) and the last two at 26X (250 rpm). A sample was taken from each shake flask at day 1 2 
and 3 and analyzed for proteolytic activity. The results are shown in tables 6-8. As it can be seen 
from tables ,6-8. the effect of the tails also increases the expression level for the Sav-L2 protease 
from Noavtopm dasson^e/ subsp. Dessonvi,,* DSM 43235 when expressed in B. subtWs An 
increase of up to 4» is obsenred in this experiment, but overa.l improvement is observed for both 
tail-vanants at all three temperatures tested. 

Table 6. Relative proteolytic activities at 37*c. 
L-D ay_1 | Dav2 | r> ay o | 
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Sav-L2 


1.0 


1.0 


1.0 


Sav-L2 HV1 


1.4 




1.2 


Sav-L2 HVO 


1.3 


1.1 


1.4 


Table 7. Relative prot 


Bolvtic act 
Day 1 


vitioe ot qi 
VlUcS 31 o 

Day 2 


J c. 
Day 3 


Sav-L2 


1.0 


1.0 


1.0 


Sav-L2 HV1 


1.0 






Sav-L2 HVO 


1.1 


1.3 


1.3 


Table 8. Relative profc 


Jolytic acti 
Day 1 


vities at 2( 
Day 2 


Day 3 


Sav-L2 
Sav-L2 HV1 
Sav-L2 HVO 


1.0 
1.3 

_ 0.2 


1,0 
1,1 
- 1-1 


1,0 
1.1 
1.1 
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Example 8. 1 0R tail-variants with heterologous pro-regions in Bacillus 

m The DMA sequence coding for the pro-region from the L2 protease from Nocardiopsis 
dassonvillei srtnp. Dassonvillel, DSM 43235 is shown in SEQ ID NO: 29, and the corresponding 
ammo ac,d sequence is shown in SEQ ID NO: 30. fK Bacillus subtilis strain denoted L210R; similar 
to the Baallus subtilis strain 10RS, but with the DNA coding for the pro-region of the L2 replacing 
the pro-region of 10RS, was made . The entjre ^ QR ^ 

region of L2, is shown in SEQ ID NO: 31 . 

Two tail variants of the above construct were also made. Tail variant HVO was 
constructed to have 8 amino acids extra in the C-terminus: QSHVQSAP (SEQ ID NO: 3) with the 
DNA shown in SEQ ,D NO: 4 inserted in front of the TAA stopcodon of the encoding sequence. 
Tail vanant HV1 was constructed to have 4 amino acids extra in the C-terminus: QSAP (SEQ ID 
NO 5 ) w,th the DNA sequence shown in SEQ ID NO: 6 inserted in front of the TAA stopcodon of 
the encoding sequence. 

The 10RL2) construct and the two tail variants were integrated by homologous 
recombination on the Bacillus subtilis MB1053 host cel. genome. Chloramphenicol resistant 
ransformants were checked for protease activity on 1% skim milk LB-PG agar plates 
(supplemented with 6 ug/m. chloramphenicol). Some protease positive colonies were further 
analyzed by DNA sequencing of the insert to confirm the correct DNA sequence, and a strain for 

^oTT* Se,eCted ' ^ d6n ° ted B sUbWis L210R - *«Wfc L210R HVO. and B.subtilis 
L210R HV1, respectively. 



Example 9. Fermentation yields of 10R tail-variants with heterologous pro-region 
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The six a subtiiis strains 10RS, 10RS HVO, 10RS HV1, L210R. L210R HVO, and L210R 
HV1, were fermented on a rotary shaking table in 500 ml baffled Erienmeyer flasks containing 100 
ml TY supplemented with 6 mg/l chloramphenicol. Six Erienmeyer flasks for each of the a subtiiis 
strains were fermented in parallel. Two of the six Erienmeyer flasks were incubated at 37'C (250 
rpm), two at 30?C (250 rpm). and the last two at 26'C (250 rpm). A sample was taken from each 
shake flask at day 1, 2 and 3 and analyzed for proteolytic activity. The results are shown in figure 
1. and in tables 9-11. As .it can be seen from the results, the effect of the exchange of the 
proregion from. 10R with the proregion from the L2 protease resulted in a surprisingly high 
; improvement on the expression level of the 10R protease as measured by proteolytic activity in 
the culture broth at 37°C. The effect is most pronounced in the two tail variants. 

Table 9. Relative proteolytic activities at 37°c. 





Day 1 


Day 2 


Day 3 


10RS 


1.0 


1.0 


1.0 


10RS HVO 


3,7 


8.9 


3.5 


10RS HV1 


3.9 


8.5 


4.3 


. L210R 


1.9 


2.3 


1.6 . 


L210R HVO 


5.3 


14.4 


7,3 . 


L210R HV1 




20.9 


; 7,6 


Table 10. Relative protec 


>fytic activ 
Day 1 


ities at 30* 
Day 2 


C. 

Day 3 


10RS 


1.0 


1.0 


1.0 


10RS HVO 


2.8 


3.1 


4.3 


10RS HV1 


3,6 


3.6 


4.9 


L210R 


0.6 


0.4 


0.9 


L210R HVO 


3.5 


3.2 


4 f 5 


L210R HV1 


3.7 


3.2 


, 4,5 


Table 11. Relative proteolytic activities at 26° 


C. 




Day 1 


Day 2 




10RS 


1.0 


1.0 


1.0 


10RS HVO 


2.6 


.- 3,0 . . 


2.8 


10RS HV1 


3.7 


3.3 


3 r 1 


L210R 


0.4 


.... 0.7 


0.4 


L210R HVO 


2.3 


2.1 


1.9 


L210R HV1 


2.2 


1.7 


1.7 



Example 10. Repeat of examples 1-9 with other 10R-like proteases 

Completely analogously with the above examples 1 through 9. similar experiments are 
carried out with the proteases of the following Nocardiopsis strains: 
(a) Nocardiopsis dassonvillei NRRL 18133 as described in WO 88/03947; 
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(b) NocantopsJs sp. NRRL 18262 as described in WO 88/03947. the DNA and amino acid 
sequences of the protease derived from Nocardiopsis sp. NRRL 18262 are shown in DK 
patent application no. 1996 00013, and WO 01/58276 describes the use in animal feed of 
acd-stable proteases re.ated to the Protease derived from A/oca^ops/ssp. NRRL no. 18262- 

(c) Nocartopsis Alba DSM 15647; the amino acid sequence of the protease is SEQ ID NO- 33 
the encoding nucleotide sequence is SEQ ,D NO: 32; the gene Is isolated from the genomic 
DNA of th,s strain by PCR-amplification using the two primers: 

1421 (SEQ ID NO: 34): gttcatcgatcgcatcggctgcgaccggccccctcccccagtc 
; 1604 (SEQ ID NO: 35): gcggatcctatcaggtgcgcagggtcagacc 

(d) Nocardiopsis prasina DSM 15648; the amino acid sequence of the protease is SEQ ID NO- 
37, the encoding nucleotide sequence is SEQ ID NO: 36; the gene is isolated from the 
genom,c DNA of this strain by PCR-amplification using the two primers- 

1346 (SEQ ID NO: 38): gttcatcgatcgcatcggctgccaccggaccgctcccccagtc 

1602 (SEQ ID NO: 39): gcggatcctattaggtccggagacggacgccccaggag 

(e) Nocardiopsis prasina DSM 15649; the amino acid sequence of the protease is SEQ ,D NO- • - 

" 1 **• nT? nUC ' e0tide ^^^^ iS SEQ ,D NO: 4 * *» gene is isolated from the . ' 
genom,c DNA of this strain by PCR-amplification using the two primers- . * 

1603 (SEQ ID NO: 42): Qltcatcgatcgcatcggctgccaccggaccactcccccagtc. and 1602 (SEQ ID 
NO, 39), 

Example 11. *, vM> monogastric performance of a 10R.|ike protease from DSM 4323S 

The performance of the Atocardtopafe dessonvzfef subspecies dassonvUm DSM 43235 
protease assayed in a monogasbic In v*o digestion mode). The performance of a punned . 
preparabon of the mature part of me protease having SEQ ,D NO: 26 (prepared as delibed 
above was tested in an ,n Wto mode! simulating me digestion in monogasbic animais ,„ 
particular, the protease was tested for its ability to improve solubilisalion and digestion of maiaeA 

rteTnvenlon^ ^ """^ ^ fMKU * " ° f 

• k , IT T C ° nSiS,ed °' " ' la * S " hich ™^-S8M substrate was initially " 

incubated with HCpepsin - simulating gasblc digestion - and subsequent* „,m pancrealin - 
s,mutat,ng mtesbna, digestion. 10 of the flasks wera dosed with the pratease at me start of .he 

tababon phase samples of ,„ ^ dfgesra were removed and analysed for soiuUiised and ' 
digested protein. 
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Table 12: Outline of in vitro dig esti on procedure 



Components added 



10 g maizeZ-SBM substrate 
(6:4) t 41 m!HCI(0.105M) 



5 ml HCI (0.105M)/ pepsin 
(3000 U/g substrate), 1 mL 
protease of the invention 



16 ml H 2 0 



7 ml NaOH (0.39M) 



5 ml NaHCO* (1M) / 
pancreatin (8mg/gdiet) 



Terminate incubation 



PH 



3.0 



3.0 



3.0 



6.8 



6.8 



7.0 



Temperature 



40°C 



40°C 



40°C 



40°C 



40°C 



40°C 



Time 
course 



t=Q min 



t=30 min 



t= 1.0 hour 



t=1 .5 hours 



t=2.0 hours 



t=6.0 hours 



Simulated digestion 
phase 



Mixing 



Gastric digestion 



Gastric digestion 



Intestinal digestion 



Intestinal digestion 



Conditions 
Substrate: 
PH: 
HCI: 
pepsin: 
pancreatin: 
temperature: 40°C 
Replicates: 5 



'4 g SBM, 6 g maize (premixed) 
3.0 stomach step/ 6.8-7.0 intestinal step 
0.105 M for 1,5 hours (i.e. 30 min HCI-substrate premixing) 
3000 U/g diet fori hour 
8 mg/g diet for 4 hours 



Solutions 
0.39 M NaOH 
0.105 M HCI 

0.1 05 M HCI containing 6000 U pepsin per 5 ml 
1 M NaHC0 3 containing 16 mg pancreatin per ml 
125 mM NaAc-buffer, pH 6.0 

Enzvme p rotein determinating 

The amount of protease enzyme protein (in what follows, Enzyme Protein is abbreviated 
EP) is calculated on the basis of the values and the amino acid sequences (amino add 
compositions) using the principles outlined in S.C.Gill & P.H. von Hippel, Analytical Biochemistry 
182,319-326,(1989). . ' 
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Experimental procedu re for in vitm mnHai 

The experimental procedure was according to the above outline. P H was measured at time 
1. 2.5. and 5.5 hours. Incubations were terminated after 6 hours and samples of 30 ml were 
removed and placed on ice before centrifugation (10000 x g, 10 min. 4'C). Supernatants were 
removed and stored at -20°C. 



Analysis 



«n ' < 1 ^ SamPleSWWeana, ^ edf0r%de9rTOOf P r0tein ^ 
10 of solubihsed and digested protein using gel filtration. 

DH determination bv the OPA.moth»>4 

The Degree of Hydrolysis (DH) of protein in different samples was determined using an 

15 oIZTT, miCr ° Wer P,3te ^ C ° l0rimetriC ^ Petersen.D, 
oZ T ' meth ° d f ° r determinin9 f °° d Pr0tein de 9 ree of ^'ysis- J-Food Sci 
2001 66. 642-646). The OPA reagent was prepared as fo.lows: 7.620 g d,Na tetraborate 
decahydrate and 200 mg sodiumdodecy, su.phate (SDS) were dissolved in 150 ml deionsed 

sTcnvl^T Wefe COmP,ete ' y diSS ° ,Ved b6f0re C ° ntinUing - 160 "« o-phthal-dialdehyde 
97 /o (OPA) was d,sso.ved in 4 ml ethanol. The OPA solution was transferred quantitatively to the 
20 above-mentioned solution by rinsing with deionized water. 176 mg dithiothreito. 99% (DTT) was 
added to the solution that was made up to 200 ml with deionized water. A serine standard (0 9516 
meqv/l) was prepared by solubilising 50 mg serine (Merck. Germany) in 500 ml deionized water 

The s ^P'e solution was prepared by diluting each sample to an absorbance (280 nm) of 

25 T h '„ SUPematantS W6re dHuted < 100 *> an automated Tecan dilution station 

25 (Mannedorf, Swtzerland). All other spectrophotometer readings were performed at 340 nm using 
de,on,zed water as the control. 25 ul of sample, standard and blind was dispensed into a microtiter 
p ata The micro-titer plate was inserted into an iEMS MF reader (Labsystems. Finland) and 200ul 
of OPA reagent was automatical dispensed. Plates were shaken (2 min; 700 rpm) before 
30 ™n7rt2 abSOrbanC6 ' Fina " y ' ^ DH ^ Ca ' CUlated - E ^ htf °W termination of all samples was 

Estimation of solubilised and dig^foH r ^\n 

The content of solubilised protein in supernatants from in vitro digested samples was 

35 TTZ ^ TTT" CrUd6 Pr ° tein (CP) USln9 961 ** te " HPLC - ^Pematants were thawed, 
filtered through 0.45 urn polycarbonate filters and diluted (1:50..v/v) with H 2 0. Diluted samples 



11 0423.020.DK 



10 



15 



20. 



25 



(R 0.9993) was made from a dHuBon senes of an *, w *o digested rafaranca maiarf-SBMsamote 
wdh known tota. prota* content Tha protain determination in ,h,s reference sampie « ZE 

.c:x:r ,, " , " , ^ MA6wal - r ' 

Tha content of digastad protain was estimated by integrating tha chromatogram area 
onaspond,ng ,o paptidas and amino adds having a moiaoular mass o, 1S00 Oa^TbeZ 

Food So. 1986 SI. 494-198; Babmszky.L; Van.D.M.J.M, Boar.H, Den.H.LA. An ,n-»itro Mahod 

TtZZT Di3es,ible cmde Protein c —- h pia FMds - j - s * «*■ CTsTs? 

173-178: Bosen.s, E gg Um . B .o. C«oa, Evaiuation o, ,n*o Memoes for Estimating 

•n S,mp,e-Stomach Animals. Nation Rasaaroh Reviews 1991 4 141-162) TcZfZTT 

£Z2 9aswn '• and subaance p usa >' - »Z 

Results 

Tha results shown in Tables 13 and 14 below indioate that the protease increased tha 
Degree of Hydrolysis <DH), as we,, as so,ubte and digesdble pmtein signify. 




^.ne,em ,st,ers within 55 'same Jotumn ind ite sgnBoint d,tterenees 1-way AnL a Tukev 
Kroner tes, PO.OS,. SD . Standand 0e.atio, % CV . Coef^en, of VadL IgZ^Z 
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Table 14: Stabilised and digested r.mri^ protein measured bv AKTA hpi r. 



Enzyme 
(dosage in 
mg EP/kg 
feed) 



Blank 



Protease of 
the invention 
(50) 



(100) 



Of total protein 



%dig. 
CP 



54.1 



57.7 



58.9 



SD 



1.1 



1.1 



0.8 



%sol.CP 



90.1 



93.2 



94.8 



SD 



1.1 



1.4 



0.9 



Relative to blank 



%dig.CP 



100.0 



106.7 



108.9 



CV% 



2.0 



1.9 



1.3 



%sol.CP 



100.0 



103.4 



105.2 



CV% 



1.2 



1.5 



0.9 



Different letters within the same column indicate significant differences (1-way ANOVA, Tukey- 
Kramertest, P<0.05). SD = Standard Deviation. %CV = Coefficient of Variance = (SD/mean value) 
5 X100% 
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Example 12. In vitro aquaculture performance of 10R-Iike protease from DSM 43235 > 

■•' •• Performance of :the protease from Nocardiopsis dassonvitlei subsp. dassonvillei -DSM 
43235 in an aquaculture in vitro model. The protease preparation as described in Example 3 was 
tested in an aquaculture in vitro model simulating the digestion in coldwater fish. The in vitro 
system consisted of 15 flasks in which SBM substrate was initially incubated with HCI/pepsin - 
simulating gastric digestion - and subsequently with pancreatin - simulating intestinal digestion. 
10 of the flasks were dosed with the protease at the start of the gastric phase whereas the 
remaining 5 flasks served as blanks. At the end of the intestinal incubation phase samples of in 
vitro digesta were removed and analysed for solubilised and digested protein. 

Table 15: Outline of aqua in vitro digestion pronprinrp. 



Components added 



10 g extruded SBM substrate, 
62 mL HCI (0.155M)/pepsin 
(4000 U/g substrate), 1 mL of 
the protease of the invention 



7mL NaOH (1.1M) 



5 mL NaHCQ 3 (1 M) / pancreatin 



PH 



3.0 



6.8 



6.8 



Temperature 



15°C 



15°C 



15°C 



Time 
course 



t=0 min 



t=6 hours 



t=7 hours 



Simulated digestion 
phase 



Gastric digestion 



Intestinal digestion 



Intestinal digestion 
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(8 mg /g diet) 










Terminate incubation 


7.0 


15 e C 


t=24 
hours 





Conditions 
Substrate: 
PH: 
HCI: 
' Pepsin: 
Pancreatin: 
Temperature: 15°C 
Replicates: 5 



10 g extruded SBM 

3.0 stomach step/ 6.8-7.0 intestinal step 
0.155 M.for 6 hours 
4000 U/g diet for 6 hours 
8 mg/g diet for 17 hours 



Solutions 
1.1 M NaOH 

0. 155 M HCI / pepsin (4000 U/g diet) 

•1 MNaHC0 3 containing 16 mgpancreatin/mL • • • » 

125 mM NaAc-buffer, pH 6.0 

Experimental procedure fnr aqua in vitm mnrid 

The experimental produce was according to the above outline. pH was measured at time 

1 . 5. 8 and 23 hours. Incubations were terminated after 24 hours and samples of 30 mL were 
removed and placed on ice before centrifugation (13000 x g, 10 min, 0'C). Supematants were 
removed and stored at-20'C. 

• Analysis 

All supematants were analysed using the OPA method (% degree of hydrolysis) and by AKTA 
HPLC to determine solubilised and digested protein (see monogastric example). 

Pre-treatment of in vitro supernatant* wi t n easy SPF m inmne 

Before analysis on AKTA HPLC supematants from the in vitro system were pretreated 
us.ng solid-phase sample purification. This was done to improve the chromatography and thereby 
prevent unstable elution profiles and baselines. The columns used for extraction were solid phase 
extraction columns (Chromabond EASY SPE Columns from Macherey-Nagel). 2 mL milliQ water 
was eluted through the columns by use of a vacuum chamber (vacuum 0.15 x 100 kPa) 
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Subsequently 3 mL in vitro sample was dispensed onto the column and eluted (vacuum 0 1 x 100 
kPa), the first V, mL of eluted sample was thrown away and a clean tube was placed beneath the 
column, then the rest of the sample was eluted and saved for further dilution. 

Results 

The results shown in Tables 16 and 17 below Indicate that the protease significantly 
increased Degree of hydrolysis and protein digestibility. 

' Table 16: Degree of Hydrops (DH) me^. ^dbvth^nPA m ^ o d . absQ> , rt ^ anH r ^ y „„ _ 



Enzyme 

(mg EP/kg diet) 



Blank 



Protease of the invention (50) 



Of total protein 



%DH 



21.30 



21.98 



SD 



0.52 



0.22 



Relative to blank 



%DH 



100.0 



103.2 



%CV 



2.42 



1.00 



utrerent .etters with.n the same colum n mdicate significant differences (1-way A NOVA Tukey- 
Kramer test, PO.05). SD = Standard Deviation: <KCV = Coefficient of Variance = (SD/mean value) 
x 100% 



lableJTLSolubilised and digested crude protein measured by AKTA HPLC, absolute 
values 



and relative 



Enzyme 
(mg EP/kg 
diet) * 
Blank 



N 



Of total protein 



%CP 
dig 

loo" 



SD 



2.2 



%CP 
sol 



89.9 



SD 



3.2 



Relative to blank 



%CP 
dig 



100.0 



%CV 



4.5 



%CP 
soi 



100.0 



%cv 



3.5 



Protease of 
the 

invention 
(50) 



1.1 



91.4 



1.5 



104.8 



2.1 



101.7 



(100) 



53.4 



0.4 



91.6 



1.0 



107.0 



h—z * I I I ' | | | ,,v JU '-u 0.7 1U1.9 ™ 1 1 

Drfferent .etters within the same column ind,cate significant differences (1-way ANOVA, Tukk 
Kramer test, P<0.05). SD = Standard Deviation. %CV = Coefficient of Variance = (SD/mean value) 
x 100%. 



101.9 



1.6 



Example 13. Fermentation and activity of 10R tail-variants TQ and TP with Savinase signal 
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Two of the S. subtilis strains of Example 5, strain 209 with the amino acid tail-variant 
TQ. and strain 211 with the tail-variant TP, together with B.subtilis Sav-10RS. were fermented on 
a rotary shaking table in 500 ml baffled Erlenmeyer flasks containing 100 ml TY supp.emented 
w,th 6 mg/l chloramphenicol. Twelve Erlenmeyer flasks for each of the three B. subtilis strains 
were fermented in parallel Four of the twelve Erlenmeyer flasks were incubated at 37"C (250 
rpm), four at 30'C (250 rpm), and the last four at 26'C (250 rpm). A sample was taken from each 
shake flask at day 1. 2 and 3 and analyzed for proteolytic activity. The results are shown in tables 
18 to 20 below. 

• As it can be seen from tables below, the effect of the 2 amino acid tails is a surprisingly 

high .mprovement on the yield of the protease, as measured by activity in the culture broth The 
effect of the 2 amino acid tails is comparable to the effect observed for Sav-10RS HV1 and Sav- 
10RS HV3 in Example 1. 
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Table 18: Relative proteolytic activities at 37°c. 





1 


• 2 


3 


10R synt-15 


1.0 . 


1.0 


1.0 


209 


7,0 


7,0 


6.0 


211 


7.2 


■ 7,7 


. 4,9 



Table 19: Relative proteolytic activities at 30'C. 





1 


2 


3 


10R synt-15 


1.0 


1.0 


1.0 


209 ~ 


4.5 


3.6 


4,9 


211 


4.0 


4.1 


5.0 



Table 20: Relative proteolytic activities at 26°C. 





1 


2 


3 


10R synt-15 


1.0 


1.0 


1.0 


209 


6.4 


4.3 


4.0 


211 


3.7 




- 4,2 . 



Example 14. Synthetic shuffled 10R-like protease tail-variants with signal 

A synthetic tail variant 10R protease encoding gene, denoted G-MAT-22, was 
constructed with a signal peptide, and the 8 amino acid Ctermina. tail of SEQ ID NO- 3 and 
introduced into a Bacillus host for expression. A surprisingly high yield of protease was achieved 
data not shown). The full coding DMA sequence of G-MAT-22 is shown in SEQ ID NO: 44 and 
the encoded pre-pro-protease is shown in SEQ ID NO: 45. The G-mat-22 protease is an alpha- 
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lytic protease-like enzyme (peptidase family S1E - old notation: S2A) This protease has . hinh 
temperature optimum (at P H 9) than the 10R protease, as shown in Figle l 

Example 15: Shuffled Pro-sequences of 10R-like Proteases 
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_ Rel. acivity 


10R 


1,0 


G-1.2 


2,9 


G-1.4 


1,4 


G-2.3 


1,6 


G-2.4 


3,4 


G-2.5 


3,0 


G-2.6 


4,2 


G-2.7 


1,6 



Table 21: Relative activity of 10R protease expressed with heterologous shuffled pro-peptides. 

Example 16. In vivo monogastric performance of tail-variant 10R-HV1 

This example describes a dose/response study with the four amino acid tail variant HV1 
of the 10R protease in the monogastric in vitro model using 10. 25, 50. and 100 mg EP/kg. and 
-using 10R protease as benchmark or control. The tail variant 1 0R-HV1 was constructed to have 4 
; am.no acids extra in the Otermjnus: QSAP (SEQ.ID NO: 5) as described above. 
In vitro conditions- 

4 g SBM, 6 g maize (premixed) 
3.0 stomach step/ 6.8-7.0 intestine step 
0.105 M for 1 .5 hours (i.e. 30 min HCI-substrate premixing) 
3000 U/g diet fori hour 
8 mg/g diet for 4 hours. 
40°C. 
5 



Substrate: 

PH: 

HCI: 

Pepsin: 

Pancreatin: 

Incubation: 

Replica: 



Enzvmes: 

10R protease: FFE-2003-00047; batch PPA21400; 154 mg EP/g product 

Freezedried 10R-HV1 FFE-2003-00077; 370 mg EP/g product 

Solution A: 1 QR. 100 mg t=p/ko diet- 

1 00 mg EP/kg diet ~1 mg EP/flask => 1 mg EP/mL 

(1 mg EP/mL * 10 mL)/ 154 mg EP/g product = 0.0649 g 

Prepare 10 mL Disolve 0.0649 g enzyme in 10 mL NaAc buffer. 



Solution C: 10R-HV1, 1Q 0 mQ FP/kf1 ritot . 
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100 mg EP/kg diet ~1 mg EP/flask =► 1 mg EP/mL 
(1 mg EP/mL * 20 mL)/ 370 mg EP/g product = 0.05405 g 
Prepare 20 mL: Disolve 0.0541 g enzyme in 20 mL NaAc buffer. 

Solution D: in R-HV1, 5 n FP^ 

50 mg EP/kg diet -0.50 mg EP/flask via 1 ml = 0.50 mg EP/ml 
Prepare 10 mL: Dilute C 2 times: 5 ml solution C + 5 mM25 m M NaAcbuffer 

; Solution E: 10R-HV1 ?s m n EP/ko riipf . 
25 mg EP/kg diet -0.25 mg EP/flask via 1 ml = 0.25 mg EP/ml 
Prepare 12 mL: Dilute C 4 times: 3 ml solution C ♦ 9 ml 125 mM NaAcbuffer 

Solution F- 10 R-HV1 IQmg fd^ ^ 
25 mg EP/kg diet -0.25 mg EP/flask via 1 ml = 0.25 mg EP/ml 
Prepare 10 mL: Dilute C 10 times: 1 ml so.ution C + 9 m. 125 mM NaAcbuffer 

* Substrates: 

Premix (40% SBM / 60% maize), FFS-2002-00121 

Chemicals: 

4.005 MHCI.AT-1-00061/29 
4.007 M NaOH, AT-1-00002/36 
25 Pancreatin FFE-2002-00052, 8xUSP 
Pepsin FFE-2003-00048, 471 U/mg 

NaOH 0.39 M- 
Prepare 500 mL: 
30 48.97 mL 3.982 M NaOH, fill with milliQ to 500 mL 

HCI1 solution n msiwi 
Prepare 2000 mL: 

52.43 mL of 4.005 HCI, fill with milliQ up to 2000 mL 
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HCL2 (HCI/pepsin) solution: 0.105 M conta ins annnn , , r ^:_,, m , 
Prepare 250 mL: '. 
Take out approx. 150 mL from the HCksolution art* <* i« 

HCI solution. "0-solut.on, add 3.18 g pepsin and fill up to 250 mL with the 

125 itiM N gAc-birffpr pt-i ft n- 

Prepay from a 2 U NaAcbuffer (KLu 04^7-2003/iab book 14169 p ,04, 
-> 12.5 mL 2 M NaAcbuffer. fill up to 20 0 mL with milliQ 

PasaiM Boiv^ m , M NaHrn con , a , nlnn t 

page 068. 9 St ° Ck ^P 3 ^ 0 " Ascribed in lab. book 14165 

Flow schemft' - 

ph * u. *zz suit: (or ^ is ^ and ia,er <,=1 •* ,h * 

NaOH is added and later ft-^s » Z " ' nleS " ni " (Wh 30 "*» 7 0 39 

centrtfugation Each seoem,.,™ • ./ * m ^Pension is sampled for 

No giass lubes Tte supeTlr "T^ ~* ^ ~ ,he «- 
h JL22. eSUPema ' an ' SareSP " ,i " too ^^'-<urmerana, y si S . ResuHs are shown 
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Table 22: 


Treatment < 


of samples in the monogastric in vitro model. 


Sample 


Enzyme 
Solution 


Enzyme 


PH 


Enzyme dose 
/kg diet: 


Pepsin 
U/g diet: 


Pancreatin 


1-5 


1 ml 
Buffer 


Blank 


3.0 


Omg EP 


3000 


mg/g diet: 
8.0 


6-10 


1 ml 
Solution A 


10R 

(FFE-2003-00047) 


3.0 


100mgEP 


3000 


8.0 


11 - 15 


1 ml 

Solution C 


10R-HV1 
(PPA22873) 


3.0j 


100 mg EP 


3000 


8.0 
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10 



15 



16-20 


1 ml 
Solution D 


10R-HV1 
(PPA22873) 


3.0 


50 mg EP 


3000 


8,0 


21-25 


1 ml 
Solution E 


10R-HV1 
(PPA22873) 


3.0 


25 mg EP 


3000 


8.0 


26-30 


1 ml 
Solution F 


10R-HV1 
(PPA22873) 


3.0 


10 mg EP 


3000 


8.0 



Soluble and Dioestihlo P~ trirr 

results are shown In Table 23. < 30 pep,ide colu ™)- The 

increase ^T^ZIZZT T ^ " ^ "** - 
°^«epro t e,nv^^ 

a^eTir .rrreV u r showin9 me ^ *— - «■ 

(1-ANOVA. TuKey. 95%, 0 " ^ *™ ^ , *' n " l »* dlffe ™<- 




The j: rr:Tr s „xr : : p r * e ,euei ot — 

50 and 25 mg EP/Ko _ _ ' ,nCreaSe » and ■**»«■ With a dose of 

20 significants lO^B^LtTl 7'°™ «~ ^ and *■* respectively and 

a CK/Kg diet the relative improvement was 0.5%. 

Degree of H ydrolysis- 



10423.020-DK 



10 



71 

The degree of hydrolysis (DH) was determined 
in Table 24. 



using the OPA method. Results are shown 



Enzyme [mgEP/kg] 


n 


Of total protein 
%DH SD 


Relative to blank 


Blank 


5 


25,89 


a 


0,43 


I 100,0 


a 


1,65 


10R (FFE-2003-00047) [1001 


5 


27,19 


be 


0,67 


105,0 


be 


2,46 


10FW100] ' 


5 


27,89 


r- 


0,36 


107,7 


c 


1,29 


10R HV i [50] 


5 


27,34 




0,57 


105,6 


be 


2,08 


IORhv, [25] 


5 


26,42 


fair 


0,57 


102,0 


ab 


2,16 


10R H vi[10] 


5 


25,52 


a 


0,96 


98,6 


a 


3,76 



=ooyine<ji-A method. Absolute as well as relative 

values are shovm. Different letters iholeate significant differences (1-wa» ANOVA, Tukey 95 %). 

'and i s Ta ~- 10R - HV1 *W ™ Hr compared to Blank. With the lower doses (50 
and 25 mg EP/kg die,) of the protease the implements ra nged from 5.6 . 2 . 0% . respecUvely in 

0RZ PreV :r ndln9S - " * *"* ^ 110 m9 E ™ 9 *« ~ - -~ ™e ong-na, 
10R [100 mg EP/kg diet) showed improvements of 5% relative to Blank 

The results of the HLPC AKTA analysis and the DH determinations cleariy show that 

Z 2 IT 3min0 "* <SE ° 10 5> ^ ,ai ' 10 1 ° R dMS - "» ■»*"»■"- - 
tne 1 0R protease to any significant extent. 
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CLAIMS 



lol™ T ePtide *** ^ Pr ° teaSe WhiCh P ol *> e P« de —P-es at .east three 

non-polar or uncharged po.ar amino acids within the .ast four amino adds of the Cterminus o^e 
polypeptide, and which polypeptide: terminus of the 

5 (a) comprises an amino acid sequence which is at least 70% identical to the amino acid 

sequence of the mature part of the polypeptide shown in SEQ ID NO: 28; SEQ ,D NO 
33; SEQ ,D NO: 37; SEQ ID NO: 41; SEQ ID NO: 43; or SEQ ,D NO- 45- 
(b) comprises an amino acid sequence which is at.least 70% identical to ihe amino acid 
Tno" I" I Lo^n T ^ ° f ^ P0 ' yPePtide 6nCOded by the PO'V-aeotlde in SEQ 
NO: 36; SEQ ID NO: 40; or SEQ ID NO: 44; 
(0 comprises a mature part which is a variant of the mature part of the polypeptide having 
the ammo ac,d sequence of SEQ ,D NO: 28; SEQ ID NO: 33; SEQ ID NO: 37; SEQ ID 
NO: 41; SEQ ID NO: 43; or SEQ ,D NO: 45 comprising a substitution, de,e«on 
15 extens,on. and/or insertion of one or more amino acids; 

(d) is an allelic variant of (a), (b), or (c); or 

(e) is a fragment of (a), (b), (c), or (d). 

w-dtype polypept.de M variant having one or more aminr>aoid(a) added to the Wermlnua 
compared t0 ^ wildtype , . ^ ^ ^ £££ * 

no^rrcjr ins ,o da * * wher * n me - - -» — — - <-> 

25 

: r r "* ,0 dairo 3 ' whare,n - ore - m ° re add * — » » - 

30 mam",he PO or P " de aCC ° rtin9 '° ^ * 0 " " "—'«*- «* «• ^eoted 

Too anZ ^ * QSHVQSAP ' QSAP ' ° P ' ^ T "' * .«■ »• LT. TO. 

ma.ural The P °' > " ,eP " d ! aCCMdln9 *° ^ " e ' almS 1 " 5 Whfch — » - - b— 

3s rr^zz : , srs zr - a r ent proteasa; ^ - - 

n i»2A or S1E protease; more preferably the pro-region is an artificial or 
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SEQ ID NO: 45. SEQ ID NO 46 sJZ S ' ° '° "° : 41 " SEQ 10 N0 - «. 

50, SEQ IDNO 5, «^ ,D "* * » "* * « ,D NO: 

5 

;r;r r„rr 1 ; 6 r when — - — — - 

heteratogous protease. <f ' pre,erab, >" he ^ro 1 o g oussec re «on signal p^ade „ deiived from . 

« 9. An isolate, pofynucleotHe encoding a po^pepbde as denned ,„ any Cciaims ,-8. 

IIT^T CXPreSSl0n "** ° r P °'~* ~ —a a palynuceodde as 

12. The recombinant host cell according ,o d ai m ,1 which is a 6ac*s ca». 

30 defined in claim 10. " e * PreSSi ° n or P*>l»""=leotida constnrct aa 

Z^T' " — > - - — 1 - * - method 

Plan, or a„La, TSiTST " ^ " ** " " « " *"~ 
35 potypap.de. and op,ona,,y (b) recovenng the polypepWa ' SUPema,a "' C ° m '" iSln8 ' he 
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10 



15 



20 



1* An an, m a, feed «■». a| ^ oto ^ ^ ^ ^ ^ ^ ^ _ ^ 



25 



(a) at least one fat-soluble vitamin, and/or 

(b) at least one water-soluble vitamin, and/or 

(c) at least one trace mineral. 



n any ot claims 1 - 8, or at least one feed additive of daim 16. 

(EC 3. 2 .,. 8); ■ZCZ^TT 7? ™ < 6C - 3.1.3.*,; „*anase 

phosphatase C (3.1.4.3,; ph 0 spho«pasets^ 3 E " l1 * ): ^^^ (EC 3.1.1.5); 
EC 3.2.1.6). spnoupase D (EC 3.1.4.4); and/or beta-glucanase (EC 3.2.1.4 or 



20 A method for using at least one polypeptide as defined in any of claims 1 8 ™™ • • 
•ncludmg the polypeptide^) in a detergent formulation. " ' mpnS ' n9 
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ABSTRACT 



P°lynuoleo«des expression vent™ Ctermmus of the polypeptide, encoding 

peptides ZZZZZZ r? " «- - » 

methods for pnoduetng se,d polypeptide, and methods for using the polypeptide 



Patent- on 
varemeerfestyretsen 

<H0> Lassen, soren Flensted Modtaget 
<120> i mproved proteases an(J methods fop produc . 

<130> 10423. 010-DK 
<160> 53 

<170> Patentln version 3.2 

<210> 1 

<211> 1062 

<212> DNA 

<213> Nocardiasis sp. nrrl 18262 
<220> 

tlV? "l^ c - f eature 

<222> (1)..C495) 

223> encodes the pro-region shown in positions -165 to -1 of SEQ id 



<220> 
< 22l> misc_feature 
<|f|> C496)..(l05l) 

NSf?f SS the mature ^ion shown in positions 1-188 of seq id 

<400> 1 

gctactggag cattacctca gtctcctaca rr<™„ 

ocattar^r , 9 tctc «aca cctgaagcag atgcagtatc gatgcaagaa 

gcattacaac gtgatcttga tcttacatca gctgaagctg aggaattact tgctgcacaa 
9 atacagcct ttgaagttga tgaagctgcc gctgaagcag ctggtgatgc a ta 
cagtattcg atactgaatc actcgaactt actgtactag tgaccgatgc agca g t 
9 aagctgttg aagccacagg tgcaggtaca gagctcgtat cttatggtat tgatgg 

taC3a9a9Ct taat9C39Ct ~ C ~ 
t 9 ; a " t9ataC — "cttg aaggctctgg agctgatgtt 

ta9Ca93C9C ^tccgcgg ttgaagtgac cacgtcagat 

t Ctat9CC9a ^™ ««■«* acacaatggg tggtcgctgc 

IT 39CCaC aaat9C39Ct 99a " aCCt9 9C " C ^- agctggacat 

.ccgcg tcggtacaca ggttactatc ggcaatggaa gaggtgtctt t gag lag 

^ r t9at9C 39a " taC9t ™ -tactaac 

Itl t99C " atat 9CaaCt9ta9 Ca " tCaCaa -aagcacct 

a t CtCa " 9tC9 9 ™* -cattcaa . 

9 ctaga ggtc agagcgtgag ^^^^ ggtaccgtaa 

g gcag aaccaggtga ctctggaggt tcatatatca gcggtacgca ag gca gg 

a t? r atCC " taa " 9ta " -Lag g 

acaccgatgg tgaactcttg gg gagttaga ctccgtacat aa . 9 
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<210> 2 10423. 020-DK.ST25.txt 

<211> 1143 

<212> DNA 

<213> Artificial sequence 

<220> 
<223> 



<400> 2 



*10^Se. S R p?fT B ( S2rS 1 a."J«H«9 a S2A protease denoted 
sequence of aWolo£ur p ™ease?la??nase? 1de enC<,d1 " 9 



atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 
agttcatcga tcgcattggc tgctactgga gcattacctc agtctcctac aL aa 
9 atgcagtat cgatgcaaga agcattacaa cgtgatcttg atcttacatc agctgaagct 
.aggaattac ttgctgcaca agatacagcc tttgaagttg atgaagctgc cgctgaa gca 
9«ggtgatg catatggtgg tttagtattc gatactgaat cactcgaact tactgtacta 
9t accgatg cagcagctgt tgaagctgtt gaagccacag gtgcaggtat agagctcgta 
tcttatggta ttgatggatt agatgagatc gtacaagagc ttaatgcagc tgatgccgtt 
ccaggtgtag ttggatggta tcctgatgta gtaggtgata ctgttgtctt agaagttctt 
9aaggttctg gagctgatgt ttctggactt ttagcagacg caggagtcga tgcatccgcg 

„ T 9tca9a tca9C " 9M ctctatgcca m ™ 

tacacaatgg gtggtcgctg cagcgtagga tttgcagcca caaatgcagc tggacaacct 
Mcttcgtga cagctggaca ttgcggccgc gtcggtacac aggttactat cggcaatgga 
agaggtgtct ttgagcaaag cgtatttccc gggaatgatg ctgccttcgt 
c aacttta cgcttactaa cttagtatct agatacaaca ttggcggata tgcaa tgta 
ggtcaca atcaagcacc tattggctct agcgtctgcc gctcagggtc gactaca ga 
tggcattgtg gaaccattca agctagaggt cagagcgtga gctatcctga aggtaccgta 
acgaacatga ctcgtacgac tgtatgtgca gaaccaggtg actctggagg ttcatatatc 

ZZZl aa9C9CM " C9ttaCCC " ^Mctgtag gacaggtggc 

acaacgttct accaggaagt gaca.cgatg gtgaactctt ggggagttag actccgtaca U40 

1143 

<210> 3 
<211> 8 
<212> prt 

<213> Artificial sequence 
<220> 

<223> c- terminal amino ar.rf t-^-ti 

invention. aCld tai1 expressed as fusion to protease of the 

<400> 3 

Gin ser wis val Gin Ser Ala Pro 
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<210> 4 
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<211> 24 10423.020-DK.ST25.txt 

<212> DNA 

<213> Artificial sequence 
<220> 

<400> 4 

caatcgcatg ttcaatccgc tcca 

24 

<210> 5 
<211> 4 
<212> prt 

<213> Artificial sequence 
<220> 

<223> iSSSS] arain ° acid tail ^pressed as fusion to protease of the 

<400> 5 

Gin ser Ala Pro 

<210> 6 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<400> 6 
caatcggctc ct 

12 

<210> 7 
<211> 2 
<212> prt 

<213> Artificial sequence 
<220> 

<223> SSiS! amiri0 aCid tail « ^sion to protease of the 

<400> 7 

Gin Pro 
1 

<210> 8 

<211> 6 

<212> dna 

<213> Artificial sequence 
<220> 

<223> ?S]?r{rS^ea e Se CO S n ? h l fflj ■*■» add tail expressed as 

<400> 8 
caacca 
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<210> 9 10423.020-DK.ST25.txt 
<211> 1 
<212> prt 

<213> Artificial sequence 
<220> 

<223> fSSSS? amin ° aCid "" expressed as fusion to protease of the 

<400> 9 

Pro 
1 

<210> 10 

<211> 3 

<212> ONA 

<213> Artificial sequence 
<220> 

<223> fS]C C Me»? h e a fiSSfi£ •«* «« expressed as 

<400> 10 
cca 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



11 
45 
DNA 

Artificial sequence 
Primer #252639 



<400> 11 

catgtgcatg tgggtaccgc aacgttcgca gatgctgctg aagag 

<210> 12 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #251992 

<400> 12 

catgtgcatg tggtcgaccg attatggagc ggattgaaca tgcg 

<210>. 13 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179541 

<400> 13 

gcgttgagac gcgcggccgc gagcgccgtt tggctgaatg atac 

<210> 14 
<211> 43 
<212> DNA 

<213> Artificial sequence 
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45. 



44 



44 



10423, 020-DK.ST25.txt 

<220> 

<223> Primer #179542 
<400> 14 

gcgttgagac agctcgagca gggaaaaatg gaaccgcttt ttc 

<210> 15 

<211> 64 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179539 

<400> 15 

ccatttgatc agaattcact ggccgtcgtt ttacaaccat tgcggaaaat agtcataggc 
atcc 



<210> 16 

<211> 60 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179540 

<400> 16 

ggatccagat ctggtacccg ggtctagagt cgacgcggcg gttcgcgtcc 

<210> 17 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179154 

<400> 17 

gttgtaaaac gacggccagt gaattctgat caaatgg 

<210> 18 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179153 

<400> 18 

ccgcgtcgac actagacacg ggtacctgat ctagatc 

<210> 19 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #317 
<400> 19 

tggcgcaatc ggtaccatgg gg 
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10423.020-DK.ST2S.txt 

<210> 20 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #139 NotI 
<400> 20 

catgtgcatg cggccgcatt aacgcgttgc cgcttctgcg 

<210> 21 
<211> 7443 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> sequence of plasmid P MB1508 
<400> 21 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgataaaagt gctttttttg 
ttgcaattga agaattatta atgttaagct taattaaaga taatatcttt gaattgtaac 
gcccctcaaa agtaagaact acaaaaaaag aatacgttat atagaaatat gtttgaacct 
tcttcagatt acaaatatat tcggacggac tctacctcaa atgcttatct aactatagaa 
tgacatacaa gcacaacctt gaaaatttga aaatataact accaatgaac ttgttcatgt 
gaattatcgc tgtatttaat tttctcaatt caatatataa tatgccaata cattgttaca 
agtagaaatt aagacaccct tgatagcctt actataccta acatgatgta gtattaaatg 
aatatgtaaa tatatttatg ataagaagcg acttatttat aatcattaca tatttttcta 
ttggaatgat taagattcca atagaatagt gtataaatta tttatcttga aaggagggat 
gcctaaaaac gaagaacatt aaaaacatat atttgcaccg tctaatggat ttatgaaaaa 
tcattttatc agtttgaaaa ttatgtatta tggagctctg aaaaaaagga gaggataaag 
aatgaagaaa ccgttgggga aaattgtcgc aagcaccgca ctactcattt ctgttgcttt 
tagttcatcg atcgcatcgg ctgctgaaga agcaaaagaa aaatatttaa ttggctttaa 
tgagcaggaa gctgtcagtg agtttgtaga acaagtagag gcaaatgacg aggtcgccat 
tctctctgag gaagaggaag tcgaaattga attgcttcat gaatttgaaa cgattcctgt 
tttatccgtt gagttaagcc cagaagatgt ggacgcgctt gaactcgatc cagcgatttc 
ttatattgaa gaggatgcag aagtaacgac aatggcgcaa tcggtaccat ggggtatatc 
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10423.020-DK.ST25.txt 
aac g c g ttaa tcc S cggata tatagcggcc gcagatctgg gaccaataat aat g acta g a K40 
aa a tgaagattgt tcacgaMtt aa9gMc9M tattggataa 

ttttaaaata tatatttatg ttaca gt aat a« g ,ctttt aaaaaa gg at t g a«cta»t 
g aa g .aa 9 ca g acaagtaag cctcctaaat tcactttag, taaaaattta gg aggcatat 
caaatgaact ttaataaaat tgatttagac aattggaaga g aaaa gag at atttaatcat 
tattt g aacc aacaaac g ac tttta g tata accaca 9 aaa t tg ata»a g t g ttttatac 
csaaacataa aacaa 9 aa gg atataaattt taccctcpcat ttattttctt a 9 tgacaa 99 
9t 9 ataaact caaataca 9 c ttttagaact ggttacaata gcgacggaga gttaggtta, 
t 99 gataa 9 t tagagccact ttatacaatt ttt 9 at 99 t 9 tatctaaaac attctct gg t 
atttggactc ctgtaaagaa tgacttcaaa gagttttatg atttatacct ttctgatgra 
gagaaatata atggttcggg gaaattgttt cccaaaacac ctatacctga aaat g ctttt 
tcmtcta «at«ca t g 9 acttcattt aetata acttaaatat caataataat 
a 9 taa«acc ttctacccat tattacagca gg aaaattca tt,ataaa 9g taattcaata 
tatttacege tatctttaca ggtacatcat tctgtttgtg at 99 ttatca tgcaggattg 
tttatgaact ctattcagga att g tca 9 at a gg cctaat g act 9g ctttt ataatatgag 
ataatgeega ctgtactttt tacagteggt tttctaacga tacattaata gg tac g aaaa 
agcaactttt «tgcg Ct ta aaaccagtca taccaataac ttaaggg.aa cta g cctc g c 
c 9g aaa 9 a 9 c gaaaatgect cacatttgtg ccacctaaaa aggagegatt tacatatgag 
"atgcagtt tgtagaatgc aaaaagtgaa atca 9 ctgga ctaaaa gggg C c 9 ca gag ta 
g aat S9 aaaa gg 9 gatcgga aaacaa g tat ata ggagg a g acctatttat g9 cttca g aa 
aaagacgcag g aaaaca g tc a 9 ca g taaa g cttgttccat tgettattae tgtcgctgtg 
ggactaatca tctggtttat «ccgctccg tccg 9 actt g aacctaaa g c tt 99 cattt g 
«t g cgat« tt g tc 0 caac aattaregge tttatctcca agcccttgcc aat gg gtgca 
attgeaattt ttgcatt gg c g gttactgca ctaactgga, cactatcaat t 9 a gg ataca 
ttaageggat tegggaataa gaccatttgg ettategtta tegcattett tatttccegg 
ggatttatca aaaceggtet eggtgegaga atttegtatg tattegttea gaaattcgga 
aaaaaaaccc ttg 9 ac«tc ttattcactg ctattca g t g atttaatact ttcacctgct 
attccaagta ataeggegeg t gcaggag g c attatatttc ctattatcag atcattatcc 3060 
gaaacattcg gatcaagccc ggcaaatgga acagagagaa aaatcggtgc attcttatta 3120 
aaaaccggtt ttca ggggaa tctgatcaca tetgetatgt tcctgacagc gatggcggcg 3380 
aaccc 9 ct g a ttgecaaget ggeccatgat gtcgcagggg tggacttaac atggacaagc 3240 
gggcaattg cccgartgt acegggaett gtaagcttaa tcatcacgc. gcttgtgatt 3300 
tacaaactgt atccgccgga aatcaa ag aa acaceggatg eggegaaaat cgcaacagaa 3360 
aaactgaaag aaatgggacc gttcaaaaaa tcggagnn 
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1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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2400 

2460 
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2700 
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2940 

3000 
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ttmct* ttttggcggc a g cttcaa« tc g ac 9 ctac cacaacc 9 ca 

t99CC8 "« c« g99 at g a t atcaa g aaa 

r W9C9 Ctt " 9ataC 9CtMCttM tt 9 tcat 9 ct c 9 ccaacttc 

t 9 aa taggcatggt gtcttggttc a9tMt9ra ^^^^^ 

«ctct, 99 , tt 9t99 catt catcatttta »tt 9 tt 9t9 t attattactc tcactattt! 
*tt 9 caa g t g c 9 aca 9 c C ca catca 9 t 9 c 9 at 9 tattca 9 cattttt 99 c t 9 rc 9 tc 9 t 9 
9«9c 999 c 9 cacc 9 cc g ct ttta 9 ca 9 c 9 ct gagC ctc 9 c 9 tt C atca 9 caacct g ttc 
« caac 9 a ctca«ac 99 ttc t 99 a 9 c 9 9 ctcc 99 tct tcttc^c a 99 ctaca« 
- 9 ca, 99 ca aat 9gtgg tc catc 99 attt atcct 9 tc 9 a ttgttcatat catc 9 tat 99 
ctt gt9 atc 9 9 c 99 attat 9 9 t 99 aa, 9 ta cta 9g aatat 99 ta 9 aaa 9 a aaaa gg ca g , 
c 9 c S9t ct 9 c cttmttat t«cactcct tc 9 taa 9 aaa a tggattttg aaaaat 9 a 9 a 

aatlT 9 t9MaaatM " t9atCta9 9 " 9aaa " a ~- 
aa 9 c 99 «cc atttt.ccct 9 caaacaaaa ataat 9999 c t 9 att 9 c 99 c tct^tc 

tt 9 tcatt 9 g t gtgc t g ac cattac g «a 9 cc gt tca g c atacaca 999 ag „ cggaga 
ca^ca^c a 9 ct 99 c 99 t t„aac ggcg a 9 aacca«t c«at atg cc 9 cc 9g ttaaa 
3a 9 ctcatt 9 a 9a9aaaa9a c 99aC at 9 c 9 9 ctca 9 ac 9 c aa 9a99t cat t 9 a a caaat 9 
aaa 9 aaca 9 a ct ggt9 c 9 « t 9 ccatttat 9 tttt 9 aac 9 aaaaa 99 a 9 a ca«c 9Mg c 
9cctc, 99 aa aaa 9 c 99 att aaa 9 aaact 9 gag <: 9 c ag ca 9 a 9 aaatttt 9 ttt 99 c 9g t 
tc 9 cat 9 ttt ctgaaacaaa a 9 c 99 at 9g a c 9 aa 9agt9 a t C a 9 a 9ggag c 9 c 9 cc 9 att 
a«aaa 9 aac a 9 a aggg ata ca 9 ccaa g , g atc 99 ca 9 c g tg tct 9 tt 9 a ttttct g caa 
ac 99 a 9 aca g a g caaa 9 car caaaaa 9 ca C tt9 a 9 aaatt t g a gtgtg at t g ct gtg ctt 
9 "" 9Ct9C tC9S " ttat =t 9 ct 99 c 9 a aaa 9 catca 9 aa agg atac g 

«c ggg «t 9 aacc 9 cat 9 a 9 atc 9 c 99 ct ctatatc g t 9 a 9 a 9g aac g c aat g cttttc 
g c g attc g a g aa 999 a rat t 9 ccaccaat c 9 t g a ag9 c 9 tc g tcaccat 9 at 9 aac9ta 
tc 99 c g9 cc 9 ag at 9 ct 9 aa 9 ct 9 ccc 9 a g cct g t 9 atcc atcttcctat ag at 9 acgtc 
at 9 cc ggg a 9 ca g99 ct 9 at 9 tct g t 9 ctt saaaaa^a,, aa atg ct 9 cc 9 aacca g9 aa 
9 taa gcg tca a c g atcaa g t g tttattatc aatac g aaa 9 tgatgaatca a 9g c g99 ca 9 
•cram. tt 9 tc gt ca 9 cttca 9g9 a 9 aaaaca 9ag c t gaagaagct 9 atc g acaca 
tt 9 aca gagg ttc 9 caaata «ca 9 a 99 at ctca 999 c g c a g actcat 9 a attttcaaat 
aa 9 ct«a tg cg a«tta 99 9c t g c 9 tc 9 a cornea t 9 caa g «t g g c 9 taatcat 
99tcata g ct gt «cct 9t9 t 9 aaatt g « atcc g ctcac aattccacac aacatac gag 
« 99 aa 9 cat aa a9 t 9 taaa 9 cct 99g9 t g cctaa tg a g t 9 a 9 ctaactc acattaatt g 
c g tt gcg ctc act 9 ccc 9 ct ttcca gt c 99 ga aacct g tc 9tg cc ag ct 9 cattaat g aa 
tc gg ccaac g c g c 9g9gaga ggcgg ttt gc g tatt 9g9 c g ctcttccgct tcctc g ctca 
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3480 

3540 

3600 

3660. 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 . 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 



10423.020-DK.ST25.txt 
ctsactcgct 9 c g ctc M tc 9 «c M « 9 c 9g c 9 a g c gg t atca 9 ctcac tcaa aggcg9 5S20 
taa t ac g9 « atccacagaa tcagggg „ a lcgcaggMa *° 

a 9 ca a a ag9 c ca 9g aacc 9 t aaaaaggccg cg „ gctgg£ » J 

cccct gacga g c a tc acaa a aatcga c gc t c aagtcaga9 9t99cgaaac ccgacaggac 
ata aag ata cca 99 c 9 ttt ccccct 9gaa 9ctccctcgt gcg „ ctcct cc 

7 3CttaC CMata " t£l "ccttc ggg aa 9 c gtgg c 9 ctttctcata 

9ctcac g « g ta gg tatctc ag ttc g9tg t aggt c 9 ttc 9 ctccaa g ct 9 gg c Wgc 
ac 9aa ccccc cgttcagcc£ gaccgctgcg ccttatccgg taaraKst ^ 

accc 99 taa 9 aca c ga ctta tc g ccact g9 c ag ca 9 ccac t 9g taaca g9 a « agcagag 
c 9 a 99 tat 9 t a9g c 99 t 9 ct aca 9 a 9 ttct tgaagtggtg gcctaactac 
9M99 aca 9 t at« ggtatc tgcgctctgc tgaagccagt taccttcgga aM 

9t a9 ctctt 9 at cc gg c aaa ca a a ccacc 9 c tgstagcgg tggttttttt gtttgc 
a g ca ga «ac 9cgcagaaaa aaaggatctc aagMgatcc tttgatcw tct g 
ct ga c g ctc a 9tg9aacgaa aac?cacgtt aagggatttt ggtcatsaga ttatMaaaa 

99 atc«cac cta 9 atcctt t« M tt aaa aatg a ag ttt t aaa tc aa tc t aa a 9 t a t at 
at gagtaaa c ttwtct^c ag ttacc aat 9 ctt aa tc ag t gaggca cct a tctc ag c ga 
t« 9 tct a « tc9tccatcc ata 9 tt 9 c<=t g actccc Cgt cgtg , agata actacgatac 
999a gg9 ctt accatct 99c CC ca gt9 ct 9 caat gata cc 9 c 9agac cc a c g ctc a cc 99 
cacc 99 attt a tc a9 caat a aa cc ag cca 9 cc 9g aa g99 c c 9 a 9 c 9 ca g a a 9 t 9g tcct 9 
caactttatc c gcC tccatc castctatta att g tt 9 cc 9 9g aa g cta g a g taa g ta g tt 
c 9 cc ag tt aa tastttgcsc a acg « 9ttg ccattgctac aggMtcgtg 
c 9 tc g ttt 99 tatggcttca ttcagctccg gttcccMcg atcaaggcga gttaca 

ccccc a t 9 tt g t g caaaaaa 9 c ggt ta 9 ct ccttc g9 tcc tc<: 9 atc 9 tt 9 tc agaagta 
a9tt 99 cc 9 c ag t g tt a t„ ctc a t gg « a t g9 ca g cact 9 cat aa ttct cttact g tc a 
t^tccst aa9 at g cttt tct 9 t 3 act g st^ctc aa cc aagt c a ttct gagaat 
a 9 t 9 t a t 9 c 9 9 c 9a cc gagt tg ctctt g cc c g9 c 9 tc a at a c g9gata at acc 9 c 9 cc a c 
ata 9 c agaa c ttt aaaagtg «catcatt 9 9 aaa a c 9 ttc « Cgg99 c ga aaactctcaa 
gg atcttacc 9 ct 9 tt gaga tcc ag ttc 9 a t g , aac cc a c tc 9 t 9 c a ccc aact g atctt 
ca S ca«cttt tactttcacc a g c gt ttct 9 gg t 9ag c aaa aac aggaagg c aa aat 9 cc 9 
c aaaa a aggg aataagggcg acacggaaat gttgaatact catactcttc ctt „ tcaat 
att a tt 9aag c a ttt a tc ag gg ttatt g tc tc a t 9agcgg ata catattt gaatgtattt 
a 9 aaaaa ta a acaaata 9g9 g ttcc 9 c 9 ca catttcccc g aaaagtgcca cct g ac g r« 
aa gaaa c C at ta TO tcat g aca tt aa cct ataaaaata9 gcgtatcacg aggccctttc 



5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 
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10423.020-DK.ST25.txt 

<210> 22 
<211> 5718 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> sequence of MB1510 genomic integration region 
<400> 22 

12221 a t99 " 9aat9 atacaaca9t n "" tc " ™- ««— » • 

acgaagaagc aaggattcce ctcgcttctc atttgtcct. tttattatac acttttttaa 120 

gcacatcttt ggcgcttgtt tcactagact tgatgcctct oaa««„~ 

wyaL 9 tcx « gaatcttgtc caagtgtcac 180 
ggtccgcatc atagacttgt ccatttttca ccgctttgao a«t« c „„ . 

„„ 9«xtgag atttttccag agcgggrtcg 240 

ttttccactc atctacaatg gttttgcctt cgttggctga gatgaacaaa atatcaggat 300 
a t ttgct caattgctca >9gctgacct cttgataggc ^ » 

g a a 9cc tagcatttta M9atttctc cgtcatagga tgat9M8ta 

C T tctt9caaC9 CC9a9aac9a t9tt9C99tt « «. 

a" " ""^^ CMCM9C " 540 

gc tttagcaatg gtcg taaagc tgtcgat^gt ttcgtcatat gtcgcttcac 

ggctttttaa ttcaatcgtc ggggcgattt ttttcagctg tttataaatg tttttatggc 660 
Octcagcgtc agcgatgatt aaatcaggct tcaag g aact gatgacctca agattggg tt 

: 9cc ""- ■*•»•*« ^ll 7 

cttttttgtt gtcatctgcg atgcccaccg gcgtaatgcc gagattgtga acggcatcca 840 
= , ctcaagcaca accacccgc , taggtgtgcc gcttactgtc - 

g ct aCtCt9 9MtCCt " 9 aCtC9Ctt " S « 3 "^ « 

ac a a9 : c99at acaat9a "< ««— — ^..^ 1020 

™ C9 " t9tCat tCttCC "" 1080 

tcattatcac atgtaacact ataatagcat ggcttatcat gtcaatattt ttttagtaaa 1140 

gaaagctgcg tttttactgc tttctcatg, aagcatcatc agacacaaat aag t g gtatg ^ 

cagcgttacc gtgtcttcga gacaaaaacg catgggcgtt ggctttagag gtttcgaaca 1260 

tatcagcagt gacataagga aggagagtgc tgagataacc ggacaatttc ttttctattt 1320 
cat t gttag tgCMattca atgtcgccga tattcatgat aatcgagaM 

atcgatatg aaaatgttcc tcggcaaaaa ccgcaagctc gtgaattcct g gtg aacatc 1440 
cggcacgctt atggaaaatc tgtttgacta aatcactcac aatccaagca ttgtattgct 1500 

tcatoTl a tT" 9C atta9aCata C " CCt9CtC «° 
tcatggtcgt gtgctccgtg cagcggcttc tccttaattt tgatttttct gaaaataggt 1620 

ccg ttccta tcactttacc atggacggaa aacaaatagc tactaccatt cctcctgttt 1680 
ttctcttcaa tgttctggaa tctgtttcag gtacagacga tcgg g tatga aagaaatata 1740 
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10423. 020-OK.ST25.txt 
9 aaa=cat 9 a aggaggaata tc 9 acat 9 aa acca g tt gt a aaa 9 a 9 tata caaat 9 ac 9 a !800 
aca 9 c rat9 aaagatgtag aggaattgca gMaatgs , t flttgcgiajg ^ 1800 

Zl ^ CaC9aC9 " S ^« ac ccL 

9 atc 99 a 9 cc a aagaaacag BtlteMge , cgcggtggga ^ 

a 9 ac 9agctc cgcaataaaa ttcacgMat cggtttMct ^ c 8 

t 9aaaaac9 c t ta9at9aag gaaaagtgct tctctttgtg acagatM aaM 

i c t«r taaa9caa " aaaaaaraa aa " raat9 

ta "" " t9 " attt t9Ca9Mt9C C8CMta99a "ttc 99tt 

a 9 r c tcaattt9ct attatatttt t9t9ataaat ts — 

t agaaaa t 99 g9g t a c atag t ggatgaaaa aagtgatgtt agctacggct ttgtttttag 

::::r cc a9ct " csc9 aac9ca9ct9 a -— 

at 9g c tgggg cg c g t a ctc g accg9 c a c 9a ca 9g c 9gatc aaaagcatcc 
9 atacc 9t c agcaacaga aaccagcttg tctcggcatt agggaaggM 
c 9 cc catttatatc aagggaacga ttgacatgaa a 

c ctt 99 cct aaatgacta t aaagatccgg agtatgat « ggacaMtat 9 9 

at9a tcct ag c a c atgggg c aaaaaagagc cgtcgggaac 9cgagagcac 
ctc« agaa aaaccaaaaa gcacgggtca tggtggatat ccctgcMK 

iiitr 9actaac9ct aaa9tcst " 9a " aaac " 

ac™ T Ca " 9aa ttCCa " at9 »««— « 3 « gtaaa 

^ 9t9MttCt9 c 9aa9 c 9aac a « tgatttt 

" at " tCt atCtt "" a 0«~c«« tt g tcct ata aacta ttt ag 

c ag c« aata gatttattga ataggtcatt taagttgagc atattagagg 9 

9 a 9aa « a tttgaagaac ccgagatc „ gatcaggcac cgcaacgttc « « 

« 9aagagat tattaaaaag ctgaaagcaa aaggctatca attagtaact 

r 9t 9M9aa9Ca9 a9a " Cta " — cca„« g c 

« ,99aa9Maa tata999aaa at99ta " t9 "«"™ 
»ca a t a tc a tat 9 tatc ac a tt 9aaagga ggggC(:tgct gtccagactg t 

aaa aataagg aataaagggg ggttgacatt att „ lctga tatgtatMt a ^ 
ta agaaaatg gaggggccct cgaaacgtaa gatgaaacct tagataaaag tgc „ ttttt 
9 tt 9caa tt 9 aagaattatt aatgttaagc tta „ taaag MMtatctt 
c 9 ccc ctcaa aagtMgaac tacaaaaMa gaatacstta tatasaaMa 

"7 at a aatgc « atc taactataga 

atgacataca anrara^rr* * 3 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 



3660 



t ga c ata c a ag c acaa cct t 9aaaatttg aaaatataac ta a » 

tgaa « atcg ctgtatttaa ttttctcaat tca „„ atiatatgccaat acattgttac 9 3 3 ™ 



3840 
3900 



10423.020-DK.ST25.txt 
aagtagaaat taagacaccc ttgatagcct tactatacct aacatgatgt agtattaaat 

gaatatgtaa atatatttat gataagaagc gacttattta taatcattac atatttttct „ w 

attggaatga ttaagattcc aatagaatag tgtataaatt atttatcttg aaaggaggga 3960 

tgcctaaaaa cgaagaacat taaaaacata tatttgcacc gtctaatgga tttatgaaaa 4020 

atcattttat cagtttgaaa attatgtatt atggagctct gaaaaaaagg agaggataaa 4080 

gagaaaaggg gatcggaaaa caagtatata ggaggagacc tatttatggc ttcagaaaaa 4140 

aacgcaggaa aacagtcagc agtaaagctt gttccattgc ttattactgt cgctgtggga 4200 

ctaatcatct ggtttattcc cgctccgtcc ggacttgaac ctaaagcttg gcatttgttt 4260 

gcgatttttg tcgcaacaat tatcggcttt atctccaagc ccttgccaat gggtgcaatt 4320 

gcaatttttg cattggcggt tactgcacta actggaacac tatcaattga ggatacatta 4380 

agcggattcg ggaataagac catttggctt atcgttatcg cattctttat ttcccgggga 4440 

tttatcaaaa ccggtctcgg tgcgagaatt tcgtatgtat tcgttcagaa attcggaaaa 4500 

aaaacccttg gactttctta ttcactgcta ttcagtgatt taatactttc acctgctatt 4560 

ccaagtaata cggcgcgtgc aggaggcatt atatttccta ttatcagatc attatccgaa 4620 

acattcggat caagcccggc aaatggaaca gagagaaaaa tcggtgcatt cttattaaaa 4680 

accggttttc aggggaatct gatcaca.ct gctatgttcc tgacagcgat ggcggcgaac 4740 

ccgctgattg ccaagctggc ccatgatgtc gcaggggtgg acttaacatg gacaagctgg 4800 

gcaattgccg cgattgtacc gggacttgta agcttaatca tcacgccgct tgtgatttac 4860 

aaactgtatc cgccggaaat caaagaaaca ccggatgcgg cgaaaatcgc aacagaaaaa 4920 

ctgaaagaaa tgggaccgtt caaaaaatcg gagctttcca tggttatcgt gtttcttttg 4980 

gtgcttgtgc tgtggatttt tggcggcagc ttcaacatcg acgctaccac aaccgcattg 5040 

atcggtttgg ccgttctctt attatcacaa gttctgactt gggatgatat caagaaagaa 5100 

cagggcgctt gggatacgct cacrtggttt gcggcgcttg tcatgctcgc caacttcttg 5160 

aatgaattag gcatggtgtc ttggttcagt aatgccatga aatcatccgt atcagggttc 5220 

tcttggattg tggcattcat cattttaatt gttgtgtatt attactctca ctatttcttt 5280 

9caagtgcga cagcccacat cagtgcgatg tattcagcat ttttggctgt cgtcgtggca 5340 

gcgggcgcac cgccgctttt agcagcgctg agcctcgcgt tcatcagcaa cctgttcggg 5400 

tcaacgactc actacggttc tggagcggct ccggtcttct tcggagcagg ctacatcccg 5460 

caaggcaaat ggtggtccat cggatttatc ctgtcgattg ttcatatcat cgtatggctt 5520 

gtgatcggcg gattatggtg gaaagtacta ggaatatggt agaaagaaaa aggcagacgc 5580 

9 9 tctgcctt tttttatttt cactccttcg taagaaaatg gattttgaaa aatgagaaaa 5640 

ttccctgtga aaaatggtat gatctaggta gaaaggacgg ctggtgctgt ggtgaaaaag 5700 
cggttccatt tttccctg 



5718 
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<210> 23 10423. 020-DK.ST25.txt 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1605 

<400> 23 

gacggccagt gaattcgata aaagtgc 

<210> 24 

<211> 42 

<212> DNA 

<213> Artificial sequence 

. <220> 
<223> Primer 1606 

<220> 

<221> misc_feature 

<||2> C13)., ( 13) 

<223> n is a, c, g, or t 

<220> 

535 c_feature 
<222> (16)..C16) 
<223> n 1S a. c, g, or t 

<400> 24 

ccagatctct atnktnktgt acggagtcta actccccaag ag 

<210> 25 
<211> 1U2 
<212> DNA 

<213> Nocardiopsis dassonvillei osm 43235 
<400> 25 

•cntagtt catcgatcgc atcggctgct ccggcccccg tcccccagac ccccgtcgcc 
gacgacagcg ccgccagcat gaccgaggcg ctcaagcgcg acctcgacct cacctcggcc 
eaggccgagg agcttctctc ggcgcaggaa gccgccatcg agaccgacgc cgaggccacc 
saggccgcgg.gcgaggcct. cggcggctca ctgttcgaca ccgagaccct cgaactcacc 
9tgctgg tc , ccgacgcctc cgccgtcgag gcggtcga9g ccaccMasc ta ^ mc 

gtcgtctccc acggcaccga gggcctgacc gaggtcgtgg aggacctcaa cggcgccgag 
gttcccgaga gcgtcctcgg ctggtacccg gacgtggaga gcgacaccgt cgtggtcgag 
Stgctggagg gctccgacgc cgacgtcgcc gccctgctcg ccgacgccgg tgtggactcc 
tcctcggtcc gggtggagga ggccgaggag gccccgcagg tctacgccga .atcatcggc 
McctHeet actacatggg cggccgctgc tccgtcggct tcgccgcgac caacagcgcc 
99tcagcccg gtttcgtcac cgccggccac tgcggcaccg tcggcaccgg cgtgaccatc 
ggcaacggca ccggcacctt ccagaactcg gtcttccccg gcaacgacgc cgccttcgtc 

ITT """"" CCt9aCCMC Ct99t " C ' C S «™ cggcogctac 
cagrcggtga ccggtaccag ccaggccccg gccggctcgg ccgtgtgccg ctccggctcc 
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27 



42 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



10423 . 020-DK. ST25 . txt 
accaccggct ggcactgcgg caccatccag gcccgcaacc agaccgtgcg ctacccgcag 
ggcaccgtct actcgctcac ccgcaccaac gtgtgcgccg agcccggcga ctccggcggt 
tcgttcatct ccggctcgca ggcccagggc gtcacctccg gcggctccgg caactgctcc 
9tcggcggca cgacctacta ccaggaggtc accccgatga tcaactcctg gggtgtcagg 
atccggacct aatcgcatgt tcaatccgct cc 

. <210> 26 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1423 

<400> 26 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccag 

<210> 27 

<211> 45 

<212> DNA 

<213> Artificial sequence . 
<220> 

<223> Primer 1475 

<400> 27 

ggagcggatt gaacatgcga ttaggtccgg atcctgacac cccag 

<210> 28 

<211> 354 

<212> prt 

<213> Nocardiopsis dassonvillei dsm 43235 
<220> 

<221> propep 

<222> (1) . . (166) 

<220> 

tlV? "??t-peptide 
<222> (1675.. C354) 

<400> 28 

AH Ala Pro val Pro Gl„ ? T hr Pro va! Ala Asp. Asp ser Ala 

Al, Ser o Met Thr olu A la Leu uy s Ar 9 Asp Le „ Asp Leu nr ser 

-140 

Ala olu 5 Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala He Glu 

"° -125 

Thr Asp^Ala Glu Ala Thr Gl^ Ala Ala Gly Glu Ala^ Tyr Gly Gly 



900 
960 
1020 
1080 
1112 



48 



45 
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Ser _ L eu s . Phe Asp Thr Glu % ^VW^^teu Val Thr ASp 
Ala ser Ala val Glu Ala val Glu Ala Thr Gly A i a G l n Ala Thr val 



-ou _ 75 

val ser His G ly Thr G lu G l y Leu rhr Glu val val Glu Asp Leu ash 

. ~ 65 -60 

Gly Ala G lu yal Pro Glu Ser Val Leu Gly Trp Tyr Pro Aso val Glu 
ser Asp Thr val val val G lu val Leu G lu G ly ser Asg Ala Asp val 
Ala Ala Leu Leu Ala Asp Ala G ly val Asp Ser Ser ser val Arg val 



-15 



Glu G lu Ala Glu G lu Ala Pro G ln val Tyr Ala Asp He n e 6ly Gly 

Leu Ala Tyr Tyr Met G ly Gly Arg gs Ser val G ly Phe Ala Ala Thr 

Asn ser Ala G ly Gln Pro G ly Phe val Thr Ala G ly „i s cys G ly Thr 

val G jy Thr Gl y val Thr lie G ly Asn G ly Thr G Jy Thr Phe. G ln Asn 

||r val Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly T hr ser Asn 

65 70 

Phe Thr Leu Thr Asn Leu Val Ser Arg Ty_r Asn Ser Gly Gly Tyr Gln 

Ser val Thr Gly Thr ser Gln Ala Pro Ala Gly ser Ala val cys Arg 

95 100 . 

ser Gly ser Thr Thr Gly Trp His cys Gly Thr He Gln Ala Arg Asn 
Gln Thr val Arg Tyr Pro Gln Gly Thr val Tyr Ser Leu Thr Arg Thr 

A|n val cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe lie Ser Gly 

145 --- 



150 



Ser Gln Ala Gln Gly val Thr s?r ci« ri*, <-i 

155 r Gly G IX Ser Gly Asn c ys Ser val 

160 165 
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«1y «ly Thr Thr Tyr Tyr „. i»X%,*Kf3 l . «n ser Trp 
u 175 180 

Gly val Arg He Arg Thr 
185 

<210> 29 
<211> 498 
<212> DNA 

<213> Nocardiopsis dassonvillei DSM 43235 
<400> 29 

gctccggccc ccgtccccca gacccccgtc gccgacgaca gcgccgccag catgaccgag 60 
gcgctcaagc gcgacctcga cctcacctcg gccgaggccg aggagcttct ctcggcgcag 120 
gaagccgcca tcgagaccga cgccgaggcc accgaggccg cgggcgaggc ctacggcggc 180 
tcactgttcg acaccgagac cctcgaactc accgtgctgg tcaccgacgc ctccgccgtc 240 
9aggcggtcg aggccaccgg agcccaggcc accgtcgtct cccacggcac cgagggcctg 300 
accgaggtcg tggaggacct caacggcgcc gaggttcccg agagcgtcct cggctggtac 360 
ccggacgtgg agagcgacac cgtcgtggtc gaggtgctgg agggctccga cgccgacgtc 420 
gccgccctgc tcgccgacgc cggtgtggac tcctcctcgg tccgggtgga ggaggccgag 480 
gaggccccgc aggtctac 4gg 

<210> 30 
<211> 166 
<212> PRT 

<213> Nocardiopsis dassonvillei DSM 43235 
<400> 30 

Ala Pro Ala Pro Val Pro Gin Thr Pro val Ala Asp Asp Ser Ala Ala 
5 10 15 

Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 

Ala Glu Glu Leu Leu ser Ala Gin Glu Ala Ala He Glu Thr Asp Ala 
3-> 40 45 

Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly ser Leu Phe Asp 

Thr Glu Thr Leu Glu Leu Thr val Leu Val Thr Asp Ala Ser Ala val 
" 70 75 80 

Glu Ala val Glu Ala Thr Gly Ala Gin Ala Thr val val Ser His Gly 
85 90 95 

Thr Glu Gly Leu Thr Glu val val Glu Asp Leu Asn Gly Ala Glu val 



110 
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Pro d u Ser val Leu Gly Trp j^S-2t«flS.« Asp nr val 
«al val „„ val Leu slu g v ser Asp Ala Asp ^ AU AU ^ ^ 

Ala asp A la 61y val Asg ser Ser Ser val Are val clu olu Ala Glu 

155 160 



Glu Ala pro Gin val Tyr 
165 

<210> 31 
<211> 1146 
<212> DNA 

<213> Artificial sequence 

<220> 
<223> 



^%?aSe S ^ U Si!8S d p?i t S s r e t Jl i ? r0 - r ?^on of SEQ id N0 : 29 fused 
construct: lORCproXiiSJf) tai1 " vari ant encoding gene; whole 



<400> 31 

C9tt "" M Mtt9tC9ra a9CaCC9MC t gt tgct«t 

a 9 ttcatc 9 a tc g ca t c gg c t 9 ctcc 99 cc cccotecccc ag accc« g t e 9 cc 9 ac 9 ac 

a 9 c 9 cc 9 «a 9 cat 9 acc 9 a flg c g ctca> 9 c 9 c g acc tcg acctcacctc . cc^ c 

:::i:r tctc " c9ca " aa9cc9cc at ~ 9 a <°— ~ * ~ 

t 1 T 9 " tCC9CC9t C9a " C " tC 9a " CCa " 9 ™«— < "«°»«c 
a a I r 999CCt 9aCC9a " tC 9t '~ 

LIT 9Ct99ta CCC99ac9t9 9a9a9C9aca 

«™I aC9 " 9aC9t C9 " 9CC " 9 ° tC9CC9aC9 " 9 ~ 
9tcc 999t99 a 99 a 99 cc 9 a 99 a 99 cccc 9 ca 99 tctat 9 ec 9 atatcat t 99 a 99 ccta 
c 8t acacaa tg9gtggtcg ctgcagcgta £ 

c t M c«c 9 t9 aca g ct gg aca« 9 c 99 c c.c^a caca 99 „ac taec^caat 
9S aa 9 a 99 t 9 tcttt 9 a 9 ca aa 9 c 9 tat« ccc 999 aat 9 at 9 c tg cctt c gtt a g a ggt 
™ t C : tta — — - aLJaa 

g r i r tcaa9c acctatt " c tcta9c9Kt 

9 9 a tgg catt 9 t gg aaccat tcaa 9 cta g a g9 tca gag c g tgag ctatcc tg aa gg tacc 

~ ac : t9actc9tac 9act9tat9t 9ca9aacca9 

a«a 9 c g9 ta c 9 caa gcg ca a 99cgttacc tcaggtggat 

acaT^ tCtaCCa " a a9t9a "" 9 a ~« ««~ ta g Lcc gt U40 

1146 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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<210> 33 

<211> 355 

<212> prt 

<213> Nocardiopsis Alba DSM 15647 
<220> 

<221> PROPEP 

<222> (l) . . (167) 

<220> 

<221> mat_peptide 
<222> (1685..C355) 

<400> 33 

Ala Thr ^ Pro Leu Pro Gin Sep Pro Thr Pro Asp Glu Ala Glu 

-loO -155 

Ala Thr Thr Met val Glu Ala lph rir. a-„ a- . 

-150 a Le y e G1n Ar 9 Asp Leu Gly Leu Ser 

-145 _ 140 

Pro ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu ser Phe 

Page 18 



60 
120 



<210> 32 1042 3 - 020-DK.ST25.txt 

<211> 1068 
<212> DNA 

<213> Nocardiopsis Alba DSM 15647 
<400> 32 

Qcgaccggcc ccctccccca gtcccccacc ccggatgaag ccgaggccac caccatggtc 
gaggccctcc agcgcgacct cggcctgtcc ccctctcagg ccgacgagct cctcgaggcg _ 
caggccgagt ccttcgagat cgacgaggcc gccaccgcgg ccgcagccga ctcctacggc 180 
ggctccatct tcgacaccga cagcctcacc ctgaccgtcc tggtcaccga cgcctccgcc 240 
9tcgaggcgg tcgaggccgc cggcgccgag gccaaggtgg tctcgcacgg catggagggc 300 
ctggaggaga tcgtcgccga cctgaacgcg gccgacgctc agcccggcgt cgtgggctgg 360 
taccccgaca tccactccga cacggtcgtc ctcgaggtcc tcgagggctc cggtgccgac 420 
gtggactccc tgctcgccga cgccggtgtg gacaccgccg acgtcaaggt ggagagcacc 480 
accgagcagc ccgagctgta cgccgacatc atcggcggtc tcgcctacac catgggtggg 
cgctgctcgg tcggcttcgc ggccaccaac gcctccggcc agcccgggtt cgtcaccgcc 
ggccactgcg gcaccgtcgg caccccggtc agcatcggca acggccaggg cgtcttcgag ww 
cgttccgtct tccccggcaa cgactccgcc ttcgtccgcg gcacctcgaa cttcaccctg 720 
accaacctgg tcagccgcta caacaccggt ggttacgcga ccgtctccgg ctcctcgcag 780 
9cggcgatcg gctcgcagat ctgccgttcc ggctccacca ccggctggca ctgcggcacc 840 
gtccaggccc gcggccagac ggtgagctac ccccagggca ccgtgcagaa cctgacccgc 900 
accaacgtct gcgccgagcc cggtgactcc ggcggctcct tcatctccgg cagccaggcc 960 
cagggcgtca cctccggtgg ctccggcaac tgctccttcg gtggcaccac ctactaccag 1020 
gaggtcaacc cgatgctgag cagctggggt ctgaccctgc gcacctga 



S40 
600 
660 



1068 



-135 ^23- 020-DK.ST25.txt 

SI- lie « §o Glu Ala Ala Thr Ala,. A !a »u Ala «„ ^ Tyr Gly 
Gly Ser Phe Asp Thr Asp se^ Leu Thr Le<1 nr V} , Leu ya , ^ 



-95 



Asp Ala ser Ala val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys 

°* -80 



Val val ser His Gly Met Glu Gly Leu Glu Glu He val Ala Asp Leu 

_65 -60 

Asn Ala Ala Asp Ala Gin Pro Gly val Val Gly Trp Tyr Pro Asp He 

-50 _ 45 

His ser Asp Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp 

-30 

val Asp ser Leu Leu Ala Asp Ala Gly val Asp Thr Ala Asp val Lys 

val Glu ser Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp He He Gly. 

. -11 5 

Gly Leu Ala Tyr Thr Met Gly Gly Arg cys ser Val Gly Phe Ala Ala 

15 20 

Thr Asn Ala ser Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly 

Thr val Gly Thr Pro val Ser lie Gly ash Gly Gin Gly val Phe Glu 

50 

Arg ser Val Phe Pro Gly Asn Asp ser Ala Phe val Arg Gly Thr ser 

Asn Phe Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr 

Ala Thr val ser Gly ser ser Gin Ala Ala He Gly ser Gin He cys 

95 100 
Ar g Ser Gl y ser Thr Thr Gly Trp ^ g Cys sly Thp ^ ^ ^ ^ 

Gly Gin Thr val ser Tyr P ro g„ Gly ^ val G ,„ As „ 

125 130 s 

Thr Asn val Cys Ala Glu Pro G ly Asp Ser Gly Gly ser Phe He Ser 

Page 19 



135 140 1042 3- 020-DK.ST25.txt 

u 145 

gg ser «,. A1 . Gln & val Thp ser Gly g ^ ^ n ^ ^ 

- «V <n, T„ r jjj Tyr Tyr «i. Clu Va1 Asn , r0 „ Leu ser ser 

■ L/:> 180 

. Trp Gly Leu Thr Leu Arg Thr 
185 

<210> 34 
<2ll> 43 
<2l2> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1421 
<400> 34 

Sttcatcgat cgcatcggct gcgaccggcc ccctccccca gtc 

<210> 35 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1604 

<400> 35 

gcggatccta tcaggtgcgc agggtcagac c 

<210> 36 
. <211> 1062 
<212> DNA 

<213> Nocardiopsis prasina DSM 15648 
<400> 36 

9 ccac cggac cgctcccra gtracccacc ccsgaggcc9 
cc tccagc gcgacctcgg cct9accccg cttgaggcca 

:::: <~ ~° « mgacgc :: acgcg 

tcc 9 tcttcg acacc g a g ac cct gg aact g acc 9 tcct 99 tcacc 9 ac 9 c c 9 cc t c 9 otc 
90 c t9t99 a 99 ccacc 99 c gcg9gt ac c g aactc gtct c «a cggc a t c a 9 

tcca99atct caac9cc9cc ~ tc < » 

cetera, c 999 t 9 acac c 9 tc 9 t«t 9 g a gg tccx gg a gggttccgg agccgacgtg 
,c 99 cc t9 c tc 9 « 9 ac 9 c c 99 c 9t99 ac g c« cggccg tcgaggtgac 

izz:: r tac9cc9a catcatc " c " tctM <" — 

c 99 t cggat tcgcggccac caacgccgcc ^^^^^ 

99 c 9 c 9 t999 caccca 9gtg a gC a« gg caac gg cc a ggg c gt «t C9 a 9 ca 9t cc 
«c«ccc gg 9C aac 9 ac 9 c c 9 c« t c 9 tc c gCggC a cgt ccaacttcac gctgaccMc 72Q 



43 



31 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



10423 . 020-OK . ST25 . txt 

ctggtcagcc gctacaacac cggcggttac gccaccgtcg ccggccacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggct ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac ccggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 

acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 106 2 

<210> 37 

<211> 353 

<212> prt 

<213> Nocardiopsis prasina dsm 15648 
<220> 

<22l> PROPEP 

<222> (1) . . (165) 

<220> 

<221> mat_peptide 
<222> (1665.. C353) 

<400> 37 , • 

-165 Thr Gly Pr ° Leu Gln ser Pro Thr p _™ 5 Glu Ala Asp Ala 

Val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 

~ J/ ° -140 

-135 6lU Ala Asp Glu L ?y n Leu Ala Ala Gin Asp Thr Ala Phe Glu 

"IjU -125 

-120 ASP G1 ° Ala Ala A n 1a c Ala Ala Ala CIV A sp Ala Tyr Gly Gly 

- L - L ^ -110 

ser s val Phe Asp Thr g1u q Thr Leu-Glu Leu Thr val Leu val Thr Asg 

Ala Ala ser val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-80 _75 

val ser Tyr Gly lie Glu Gly Leu Asp Glu He lie Gin Asp Leu Asn 

-65 -60 

Ala Ala A|p Ala val Pro Gly val val Gly Trp Tyr Pro Asp val Ala 

Gly Asp Thr val Val Leu Glu val Leu Glu Gly Ser Gly Ala Asp Val 

ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala val Glu val 

-15 -10 
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10423.020-DK.ST2S.txt 
Thr ser Ser Ala Gin Pro Glu Leu Tyr Ala Asp He n e Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg cys Ser Val Gly Phe Ala Ala Thr 



20 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His Cys Gly Arg val 

Gly Thr Gin Val Ser lie Gly Asn Gly Gln fly ^ phe ^ ^ ^ 

50 55 

He Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr ser Asn Phe 

65 70 

Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

val Ala Gly His Asn Gin Ala Pro rle Gly ser ser val cys Arg ser 

Gly ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin 

Ser val ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 

A " 130 135 

val cys Ala Glu Pro Gly as p ser Gly Gly ser Tyr He ser Gly Asn 

Gin Ala Gin Gly val Thr ser Gly Glv ser Gly Asn cys Arg Thr Gly 

160 165 

Gly Thr Thr Phe Tyr Gin Glu val Thr Pro Met val Asn ser Trp Gly 

ISO 

val Arg Leu Arg Thr 
18 5 

<210> 38 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1346 

<400> 38 

gttcatcgat cgcatcggct gccaccggac cgctccccca gtc 



<210> 39 
<211> 38 



Page 22 



38 



<212> DMA 10423.020-DK.ST25.txt 
<213> Artificial sequence 

<220> 

<223> Primer 1602 
<400> 39 

gcggatccta ttaggtccgg agacggacgc cccaggag 

<210> 40 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina dsm 15649 
<400> 40 

gccaccggac cactccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgaggccg ccggtgacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgactc cgccgcggtc 240 

gaggcggtgg aggccaccgg cgccgggacc gaactggtct cctacggcat cacgggcctc 300 

gacgagatcg tcgaggagct caacgccgcc gacgccgttc ccggcgtggt cggctggtac 360 

ccggacgtcg cgggtgacac cgtcgtgctg gaggtcctgg agggttccgg cgccgacgtg 420 

ggcggcctgc tcgccgacgc cggcgtggac gcctcggcgg tcgaggtgac caccaccgag 480 

cagcccgagc tgtacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt 540 

tcggtcggct tcgcggccac caacgccgcc ggtcagcccg ggttcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgaccatc ggcaacggcc ggggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggaacgt ccaacttcac gctgaccaac 720 

ctggtcagcc gctacaacac cggcggctac gccaccgtcg ccggtcacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggtt ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac gcggaccacc 900 

gtgtgcgccg agcccggega ctccggcggc tccwttct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 
acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 

<210> 41 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina dsm 15649 
<220> 

<221> PROPEP 

<222> (1) . . C165) 

<220> 

<22l> mat_peptide 
<222> C166)..(353) 

<400> 41 

Page 23 



1062 



10423.020-0K.ST25.txt 
AH Thr fly Pro Le „ ^ „. sep pro Thr ^ Ma a ^ 

val o ser Met Gin Glu Ala. Leu 61 „ Arg Asp ^ Gly Leu Thr pr<> 

LJU 5 Glu Ala Asp Glu Leu Ala Ala Gl „ ^ Thr AU 

Val o As p Glu A la Ala ^ Glu Ala A la Gly ^ A la Tyr Gly Gly 

-tos Va1 Phe ASP ?1S 0 «* 01" U. Thr val Leu val Thr Asp 

y5 -90 

ser Ala Ala val Glu Ala val Glu Ala Thr Gly A la Gly Thr Glu Leu 

-80 _ 7S 

Va! ser Tyr ^ H e Thr Gly Leu Asp Glu He Val Glu Glu Leu Asn 

" 65 -60 

Ala Ala Asp Ala val Pro Gly val val Gly Trp Tyr Pro Asp val Ala 

Gly Asp Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp Val 

flv Gly Leu Leu Ala Asg Ala Gly val Asp Ala Ser Ala val Glu val 

" 15 -10 
Thr Thr Thr Glu Gin Pro Glu Leu Tvr Ala Asp He lie Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg cys Ser val Gly Phe Ala Ala. Thr Asn 

x:> 20 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His Cys-Gly Arg val 



20 

,_ e v al Thr Ala Gly Hi- 
30 35 

Gly Thr Gin val Thr lie G ly Asn Gly Arg Gly val Phe Glu Gin 



45 u •/ y vai pne Glu Gin Ser 



He Phe Pro Gly «„ «p Ala Ala Phe val A rg Gly Thr ser «n Phe 

65 70 

Thr Leu Thr A| n Leu Val ser A r g Tvr A sn Thr Gly 01y jyr A la Thr 
val A la Gly His «„ Gin Ala Pro He Gly ser ser VaJ cys Arg ser 

Page 24 



10423.020-DK.ST25.txt 
Gly Ser Thr Thr Gly Trp His c ys Gly Thr He Gin Ala 



ii6 -jr mr x.e Gin Ala Arg cly Gin 



Ser val sen Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 

130 335 

val cys Ala Glu Pro cly Asp ser G1 y cly 5er ^ Ilc scp ^ Asn 
Gin Ala Gin cly val Thr ser cly cly S er G1y As „ Cys „ ? T „ r ^ 

Cly Thr Thr rhe Tyr Gin Glu val Thr Pro M et val Asn ser Trp Gly 

val Arg Leu Arg Thr 
185 

<210> 42 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1603 

<400> 42 

gttcatcgat cgcatcggct gccaccggac cactccccca gtc 

<210> 43 

<211> 353 

<212> prt 

<213> Nocardiopsis sp. nrrl 18262 
<220> 

<221> PROPEP 

<222> (D..C165) 

<220> 

<22l> mat_peptide 
<222> (1665.. (10S9) 

<400> 43 



Al| 5 Thr Gly Ala Leu Pro, cln ser Pro Thr „ Glu Ala Asp A „ 
«J 0 ser Met Gl„ Glu Ala. L eu Gin Arp Asp Le^ Asp Leu Thr ser 
-lh GlU AU G1 " G,u IrJSo 'la Ala Gin Asp,. Thr Ala Phe Glu 
YJ} 0 asp Glu Ala Ala AU Glu Ala Ala Gly Asp Ala Tyr Gly Gly 



-110 
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10423.020-DK.ST25.txt 

-105 PHe ASP ThP Ser Le « «1« «-eu Thr Val Leu Val Thr Asp 

~ 95 -90 

Ala Ala Ala val Glu Ala val Glu Ala Thr Gly Ala Gly Thr Glu Leu 



val ser Tyr Gig He Asp Gly Leu Asp G lu lie val Gin Glu Leu Asn 

-65 _ 60 

Ala Ala Asp Ala val Pro Gly yal val Gly Trp Tyr Pro Asp val Ala 

Gly asd Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp Val 

7" -30 

Ser Gly Leu Leu Ala Asp Ala Gly val Asp Ala Ser Ala Val Glu Val 

-15 _ 10 

Thr Thr ser Asp Gin Pro Glu Leu Tyr Ala Asp He He Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg cys Ser val Gly Phe Ala Ala Thr Asn 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly Arg val 

Gly Thr Gin val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin ser 

50 55 

val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr ser Asn Phe 

6 5 70 

Thr Leu Thr Asn Leu val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

80 85 

val Ala Gly His Asn Gin Ala Pro He Gly ser ser val cys Arg Ser 

yD 100 

Gly ser Thr Thr Gly Trp His cys Gly Thr He Gin Ala Arg Gly Gin 

115 

Ser val Ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 

5 130 135 . 

val cys Ala Glu Pro Gly Asp ser Gly Gly ser. Tyr He Ser Gly Thr 

145 150 

Gln.Ala Gin Gly Val Thr ser Gly Gly ser Gly Asn cys Arg Thr Gly 

Page 26 



10423.020-DK.ST25.txt 
Gly Thr Thr Phe Tyr Gin Glu val n,. « 

17Q y Glu jal Thr Pro Met val A|n Ser Trp Gly 

Va1 i5f Leu Ar9 Thr 



<210> 44 

<211> 1164 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthetic protease encoding gene 
<220> 

<221> cos 

<222> (1) . . C ii 64:) 

<223> Full length protease 

<220> 

*||1> sig_peptide 

<222> (15..C81) 

<220> 

^If^ "?£sc_feature 

<|22> (82) . . C1164) 

<223> Propeptide 

<220> 

<221> mat_peptide 
<222> C5775.Tail4) 

<400> 44 

S3 K K S3 ES I?J gj ffi fif 2ft II? « K ffi S « 

"- 185 -180 

att tea gt 



» % «: as ii? s s "i xa e K m & m *> 

~- L/0 -165 

ffi S S3 SK S pfl lh? a R H- st iff fif is? 33 i» 

~ 155 -150 

a ai iff 2i si ??i as q js ffi - g 180 

~ J - 40 -135 

K R! Sj 0 2S Si St §?S 8| ?B K £ ft 89 SS R 2 " 

-" 5 -120 

Si SI Iff I?S Iff Iff fl- l« ffl « g?J g? c « ? « gj 270 

" J " L0 -105 

as s? k is a «t ss ffi @ a & «- ??; ffi si. 

83 ffi Iff ?|f S5 Iff fl- ? ca „ aca «. ctt gtt tea tat 366 
-85 _ gQ w, y Aia Gly Thr val Leu Val ser Tyr 



-75 
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gga att 
Gly He 
-70 

get gtt 
Ala val 



10423.020-DK.ST25.txt 



gat ggc 
Asp Gly 



ccg ggc 
Pro Gly 



gtt gtc 
val val 



ctg gca 
Leu Ala 



gat caa 
Asp Gin 
-5 

at g ggc 

Gl 



ctt gaa 
Leu Glu 
"35 

gac gca 
Asp Ala 
-20 

ccg gaa 
Pro Glu 



ctt gat gaa 
Leu Asp Glu 
-65 

gtt gtt ggc 
val val Gly 
-50 

gtt ctt gaa 
val Leu Glu 



att gtt 
lie val 



tgg tat 
Trp Tyr 



gga tea 
Gly ser 
-30 



caa gaa ctg aat 
Gin Glu Leu Asn 
-60 

ccg gat gtt get 
Pro Asp val Ala 
-45 

ggc gca gat gtt 
Gly Ala Asp val 



gga gtc gat gca tea 
Gly val Asp Ala ser 
-15 



Met Gly Sty Arg 



ggc aga 

~il> 



caa ccg 
Gin Pro 



gtt tea 
Val ser 



ggc aat 
Gly Asn 
60 

aat ctg 
Asn Leu 
75 

cat aat 
His Asn 



ggc ttt 
Gly Phe 
30 

att ggc 
He Gly 
45 

gat tea 
Asp ser 



ctt tat gca 
Leu Tyr Ala 
-1 1 

tgc age gtt 
Cys ser Val 

gtt aca gca 
Val Thr Ala 



gat att 
Asp lie 



ggc ttt 
Gly phe 



aat ggc aaa 
Asn Gly Lys 



gtt tea 
val ser 



caa gca 
Gin Ala 



gca ttt gtt 
Ala Phe val 
65 

aga tat aat 
Arg Tyr Asn 
80 

ccg att ggc 
Pro He Gly 
95 y 



ggc cat 
Gly His 
35 

ggc gtt 

Gly val 
50 

aga ggc 
Arg Gly 



gca gtt 
Ala val 



att ggc 
lie Gly 
5 

gca gca 
Ala Ala 
20 

tgc ggc 
Cys Gly 



gaa gtt 
Glu Val 
-10 



gca get gat 
Ala Ala Asp 
-55 

gga gat aca 
Gly Asp Thr 
-40 

tea ggc ctg 
Ser Gly Leu 
-25 

aca aca tea 
Thr Thr ser 



ggc ctg 
Gly Leu 



aca aat 
Thr Asn 



aca gtt 
Thr val 



ttt gaa 
Phe Glu 



aca tea 
Thr ser 



tea ggc 
Ser Gly 



tea gca 
Ser Ala 



Th? 2? C i" ??* 9gc aca 
Thr Gly Trp His cys Gly Thr 

110 



tat ccg 
Tyr pro 



gaa ccg 
Glu Pro 
140 

ggc gtt 
Gly val 
155 

tat tac 
Tyr Tyr 



caa ggc 
Gin Gly 
125 

ggc gat 
Gly Asp 



aca gtt tat 
Thr Val Tyr 



aca tea 
Thr ser 



caa gaa 
Gin Glu 



aga aca 
Arg Thr 



caa teg 
Gin ser 
190 



tea ggc ggc 
Ser Gly Gly 
145 

ggc ggc tea 
Gly Gly ser 
160 

gtt aat ccg 
Val Asn Pro 
175 

cat gtt caa 
His val Gin 



att caa 
lie Gin 
115 

agt ctg 
Ser Leu 
130 

tea tat 
Ser Tyr 



ggc tat 
Gly Tyr 
85 

gtt tgc 
val cys 
100 

gca aga 
Ala Arg 



cga age 
Arg ser 
55 

aat ttt 
Asn Phe 
70 

gca aca 
Ala Thr 



gca tat tat 
Ala Tyr Tyr 
10 

gca tea ggc 
Ala ser Gly 
25 

ggc aca cca 
Gly Thr Pro 
40 

att ttt ccg 
lie Phe Pro 



aca ctt aca 
Thr Leu Thr 



aga tea 
Arg ser 



aat caa 
Asn Gin 



aca aga aca aca 
Thr Arg Thr Thr 
135 

att age ggc act 
lie ser Gly Thr 
150 



ggc aat 



tgc agt get ggc 

~" 3l\ 



Gly Asn cys sir Ala gTv 
165 



atg ctt 
Met Leu 



tec get 
Ser Ala 
195 



agt tea tgg ggc 
Ser ser Trp Gly 
180 

cca 
Pro 
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gtt gca ggc 
val Ala Gly 
90 

ggc tea aca 
Gly ser Thr 
105 

aca gtt agg 
Thr val Arg 
120 

gtt tgt gca 
Val cys Ala 



caa gca caa 
Gin Ala Gin 



ggc aca aca 
Gly Thr Thr 
170 

ctt aca ctt 
Leu Thr Leu 
185 



414 
462 
510 
558 
606 
654 
702 
750 
798 
846 
894 
942 
990 
1038 
1086 
1134 
1164 



10423.020-DK.ST25.txt 



<210> 45 

<211> 388 

<212> prt 

<213> artificial sequence 
<220> 

<223> synthetic Construct 

<400> 45 



Met Lys L_y k Pr „ Leu Gly Lys ^ va , AU sep flir ^ l ^ 

116 " r -& AU Phe »«■ ~ % n. AT ser Ala AT Thr G!y 

AT Leu Pro, <n„ ser pro rtr G , u AU s ^ 

155 -150 

G1 " G1 " ?& ^ G1 " Ar » "P * «P C Thr ser AT 61 u AT 

GT Glu L go Leu AT AT Gin «p Thr AT Phe Glu val Asp Glu 

" -120 

Ala Ala Ala Glu Ala Ala riw A«r~ 

-115 * Ala Gly A f? 0 Ala Tyr Gly Gly Ser Val Phe 

110 -105 

Asp Thr ; t ser Leu Glu Leu Thr va, ueu val Thr asp Ala AT Ala 

" -90 

val Glu AT val Glu Ala Thr Gly at Gly Thr val Leu Va! ser Tyr 
?Jv IT Asp Gly Leu Asp Glu XT va! cm <*„ Leu Asn Ala Ala As 

-60 _ 5 g 

AT val Pro Gly val Val Gly Trp Tyr Pro Asp val Ala aly asp Thr 

-40 

val val Leu Glu val Leu Glu sly Sfr 01y aT Asp Val ser Gly L e„ 

L eu Ala asp Ala Gly val Asp Ala ser Ala val. Glu val Thr Thr ser 

3 -10 

Asp Gin Pro Glu Leu Tyr Ala Asn T i a n 

-5 i a AS P «e lie Gly Gly Leu Ala Tyr Tyr 

5 10 
Met Gly Gly A r 9 cys ser val c , y Pnc g . A „ ^ As „ ^ ^ ^ 



10 

j Ala Thr Asn Ala Ser 
20 25 
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«. Pro 0, y Pj e v„ Tor A1 , c, y Thr pro 

" 40 
Val Ser lie Gly Asn Gly Lys Gly Val phe ^ Apg ^ ^ ^ ^ 

Gly Asn Asp ser Ala Phe val Arg Gly T hr Sen Asn Phe Thr Leu Thr 

OD 70 

A|n Leu val. Ser Arg Tyr Asn ser Gly 6 ly Tyr Ala Thr val Ala Gly 

85 90 

His Asn Gin Ala Pro He Gly Ser Ala Val Cys Arg Ser Gly ser Thr 

100 105 

Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn Gin Thr val Arg 

J "° 120 

Tyr Pro Gin Gly Thr val Tyr Ser Leu Thr Arg Thr Thr val Cys Ala 

3.3 5 

Glu Pro Gly Asp ser Gly Gly ser Tyr He ser Gly Thr Gin Ala Gin ■ 

Cly val Thr ser Gly Gly ser Gly Asn cys ser Ala Gly Gly Thr Thr 

165 170 

Tyr T yr Gin Glu val Asn Pro Met Leu ser ser Trp Gly Leu Thr Leu 

Arg Thr Gin ser His val Gin ser Ala Pro 
190 195 



<210> 46 

<211> 165 

<212> prt 

<213> Artificial sequence 
<220> 

<223> Shuffled pro-peptide 0-2.19 
<220> 

<221> PROPEP 

<222> CD . . C165) 

<400> 46 



Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala val 

10 15 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu 

25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 
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35 lW23.020-DK.ST25.txt 
" 40 45 

Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser val Phe Asp 

Thr clu Ser Leu Thr Leu Thr val Leu val Thr Asp Ala Ser Ala Val 

75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala L ys val val ser His Gly 

Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 

Gin Pro Gly val Val Gly Trp Tyr Pro Asp He his Ser Asp Thr val 

val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Asp ser Leu Leu 

i:5:> 140 

Ala Asp Ala Gly val Asg Ala ser Ala Val Glu val Thr Thr ser Asp 

155 160 

Gin Pro Glu Leu Tyr 
165 

<210> 47 
<211> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-2.73 
<220> 

<221> PROPEP 
<222> (1) . . (166) 

<400> 47 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala val 

10 15 

ser wet Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu ser Ser Ala Glu 
Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 



Ala Ala Ala Gly Ala Ala Gly AS p Ala Tyr Gly Gly ser val Phe Asp 
Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp Ala ser Ala val 



75 80 
Page 31 
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Glu Ala val Glu Ala Ala Glv *i a «-t -i 

8$ a Ala Gly A la Glu Ala Lys val val ser His Gly 

90 95 

Met Glu Gly Leu Glu Glu ile val *i, A , , 

100 1,6 Val tif ASp Leu Asn Al * Ala Asp Ala 

105 110 

sin Pro ^ val val G , y Trp T pro fl . ^ ^ ^ 

* u 125 

val val «,„ val LeU Glu gy ser 0 i y Ala « p V?J „ p Ser ^ ^ 

Aja Asp A ,a Gl„ val A |g Thr Al . Asp val ya , ^ ^ ^ 



Glu Gin Pro Glu Leu Tyr 
165 

<210> 48 
<2ll> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-l.43 
<220> 

<22l> PROPEP 
<222> (1) - - (166) 



<400> 48 

Ala Thr Sly A1 , , eu pro 61 „ ser pro ppo au 

10 15 
ser „et Ola Glu Ala Leu Gln Arg ^ oly ^ ^ ^ ^ ^ 

" 30 
Ala Glu f lu Leu Leu Asp AU gi) ^ ^ ^ ^ u 

Ala Ala Ala A ,a A ,a Ala |l y Asp Ala Tyr sly sar fl# phe 

60 

Thr AS p ser L eu Thr tea Thr val Leu val Thr A sp A ,a Ser Ala val 

75 80 
Clu Ala val Glu Ala Ala Gl y Ala slu & Lys w , ^ Ms 

90 95 

Met Glu Gly Leu Glu Glu He Va i Air . A 

100 Ile Va1 A Jf AS P >-eu Asn Ala Ala Asp Ala 

105 110 
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10423 . 020-DK . ST25 . txt 
Gin Pro Gly val Val Gly Trp Tyr Pro Asp lle His ser Asp Thr val 

val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Asp ser Leu Leu 

140 

Ala Asp Ala Gly val Asp Thr Ala Asp val Lys val Glu ser Thr Thr 

155 160 

Glu Gin Pro Glu Leu Tyr 
165 

<210> 49 
<211> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.6 
<220> 

<221> propep 
<222> (1) . . (166) 

<400> 49 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala val 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu 

" 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 

Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly ser n e Phe Asp 

Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp ser Ser ser Val 

75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His Gly 

90 95 

Met Glu Gly Leu Glu Glu lie val Ala Asp Leu Asn Ala Ala Asp Ala 

xu:> HO 

<nn Pro va, v al sly Trp g Pr0 Asp ^ ^ ^ 

u 125 

Val Leu Glu Val Leu Glu Glv s P r riw A i, a- '„ -i 

130 VX 5er G1 y Ala As P val Asp ser Leu Leu 

" 140- 
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Ala Sly M. 6ly val « g ^ .^^fvaV^ ser Thr xHr 



160 



Glu Gin Pro Glu Leu Tyr 
165 

<210> 50 
<211> 165 
<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-2.5 
<220> 

<221> propep 
<222> (1)..(165) 

<400> 50 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala val 

10 15 

Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Leu Glu 
u 25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 

Ala Ala Ala Glu Ala Ala Gly AS p Ala Tyr Gly Gly ser Val Phe Asp 

Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ser Ala val 

/U 7 ? 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His Gly 
* 90 95 

Met Glu Gly Leu Glu Glu He val Ala Asp Leu Asn Ala Ala Asp Ala 
w 105 HO 

Gin Pro Gly val val Gly Trp Tyr Pro Asp He His ser Asp Thr val 

val Leu Glu val Leu Glu Gly ser ply Ala Asp val Asp Ser Leu Leu 

Ala Asp Ala Gly val Asp Ala Ser Ala val Glu val Thr Pro Ala Ala 

155 160 

Arg pro Glu Leu Tyr 
165 

<210> 51 
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<211> 166 10423.02O-DK.ST25.txt 

<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-2.3 
<220> 

<221> PROPEP 

<222> (1)..(166) 

<400> 51 

Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Asp Gly Ala Glu Ala 

10 15 

Thr Thr Met yal Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Ala 

25 30 

Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp 

u 45 

Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser He Phe 

" 60 

asp Thr Asp ser Leu Thr Leu Thr val Leu val Thr Asp Ala Ala Ala 

75 80 

val Glu Ala Val Glu Ala Ala Gly Ala gTu Ala Lys Val val ser His 

90 95 

Gly Met Glu Gig Leu Glu Glu He val Ala Asp Leu Asn Ala Ala Asp 

Ala val Pro Gly val val Gly Trp Tyr Pro Asp val Ala Gly Asp Thr 

x^u 125 

val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Tyr Ser Leu 

140 

Lju Ala Asp Ala Gly val Asp Ala ser Ala val Glu val Thr Pro Ala 

155 



160 



Ala Gin Pro Glu Leu Tyr 
165 

<210> 52 
<211> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-1.4 
<220> 
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<221> propep 10423. 020-DK.ST25.txt 

. <222> CD . . (166) 

<400> 52 

Ala Thr cly Ala Leu Pro cl„ ser Pro Thr Pro clu Ala A sp Ala val 

10 15 

sar «et cl„ clu Ala Leu cl„ A rg Af p Leu cly Leu ser ser Ser do 
Ala clu |1„ L eu Leu Asp Ala cl„ Ala clu ser Phe clu lle Asp 61u 

A!a Ala Ala Ala Ala Ala Ala Asp Sar T y r «31 y cly Ser He Phe Asp 

60 

Jhr Asp ser Leu Thr Leu Thr val Leu val Thr Asp Ala ser Ala val 

75 80 

Clu Ala val clu Ala Ala cly Ala clu Ala Lys va, val ser His cly 

90 95 

«at clu cly Leu clu clu Ile val Ala Asp Leu Asn Ala Ala Asp Ala 

Ola Pro Cly v,l val cly Trp Tyr Pro Asp He His Ser Asp Thr val 

val Leu Clu val Leu clu cly ser cly Ala Asp Val Asp ser Leu Leu 

x " 140 

Al| Asp Ala Gly val Asp rhr Ala Asp val uys val 6 1 U ser Thr Thr 

' 155 ifn 



Glu 61 n Pro clu Leu Tyr 
165 

<210> 53 
<211> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-1.2 
<220> 

<22l> PROPEP 
<222> (l) . . ( 166) 

<400> 53 

Ala Thr cly Ala Leu Pro do ser Pro Thr Pro clu Ala Asp Ala val 

10 15 
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Ser wet Gin 61 u Ala Leu Gin Ara ls 3 r/?i 0 ~ DK ' ST25 - txt 

2Q ueu b.n Arg Asp Leu Asp Leu Thr ser Ala clu 

^ 30 



Clu clu La „ Leu Ala Ala g „ Asp Thp au wi 

45 

Ala Ma Ala Al, Al a ? l y Asp Ala Tyr Gly gy ^ ^ ^ 



Thr «,„ r*r Leu clu Leu Thr val L eu val Thr Asp Ser ser ser 

75 80 
«• Ala val clu Ala Ala 01y A „ „„ Ly£ ^ ^ ^ 

90 95 



«t clu cl y Leu clu Glu lle yal & wp a ^ au ^ ^ ^ 

- P~ c, y val val c, y T r p jg Pro Asp n . „ js ffi ^ ^ ^ 
«1 jg c,u val Lau „„ fi ser Gly AU wp Aw ^ ^ 



140 

Ala Gly Ala Gly val Asp Thr Ala Asp val 



ISO M ' a ASP Val V^f Val Glu Ser Thr Thr 

" 5 160 



Glu Gin pro Glu Leu Tyr 
165 
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